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Preface to the Second Edition 


The purpose and organisation of this book are described in the preface to the first 
edition (1988). In preparing this edition minor changes have been made, par- 
ticularly to Chap. 1 to keep it reasonably current. However, the rest of the book 
has required only minor modification to clarify the presentation and to modify 
or replace individual problems to make them more effective. The answers to the 
problems are available in Solutions Manual for Computational Techniques for 
Fluid Dynamics by C. A. J. Fletcher and K. Srinivas, published by Springer-Verlag, 
Heidelberg, 1991. The computer programs have also been reviewed and tidied up. 
These are available on an IBM-compatible floppy disc direct from the author. 

I would like to take this opportunity to thank the many readers for their usually 
generous comments about the first edition and particularly those readers who 
went to the trouble of drawing specific errors to my attention. In this revised edi- 
tion considerable effort has been made to remove a number of minor errors that 
had found their way into the original. I express the hope that no errors remain 
but welcome communication that will help me improve future editions. 

In preparing this revised edition I have received considerable help from Dr. K. 
Srinivas, Nam-Hyo Cho, Zili Zhu and Susan Gonzales at the University of Sydney 
and from Professor W. Beiglbóck and his colleagues at Springer-Verlag. I am very 
grateful to all of them. 


Sydney, November 1990 C. A. J. Fletcher 


Preface to the First Edition 


The purpose of this two-volume textbook is to provide students of engineering, 
science and applied mathematics with the specific techniques, and the framework 
to develop skill in using them, that have proven effective in the various branches 
of computational fluid dynamics (CFD). Volume 1 describes both fundamental 
and general techniques that are relevant to all branches of fluid flow. Volume 2 
provides specific techniques, applicable to the different categories of engineering 
flow behaviour, many of which are also appropriate to convective heat transfer. 

An underlying theme of the text is that the competing formulations which are 
suitable for computational fluid dynamics, eg. the finite difference, finite ele- 
ment, finite volume and spectral methods, are closely related and can be inter- 
preted as part of a unified structure. Classroom experience indicates that this ap- 
proach assists, considerably, the student in acquiring a deeper understanding of 
the strengths and weaknesses of the alternative computational methods. 

Through the provision of 24 computer programs and associated examples and 
problems, the present text is also suitable for established research workers and 
practitioners who wish to acquire computational skills without the benefit of for- 
mal instruction. The text includes the most up-to-date techniques and is sup- 
ported by more than 300 figures and 500 references. 

For the conventional student the contents of Vol. 1 are suitable for introduc- 
tory CFD courses at the final-year undergraduate or beginning graduate level. The 
contents of Vol. 2 are applicable to specialised graduate courses in the engineering 
CFD area. For the established research worker and practitioner it is recommended 
that Vol. 1 is read and the problems systematically solved before the individual’s 
CFD project is started, if possible. The contents of Vol. 2 are of greater value after 
the individual has gained some CFD experience with his own project. 

It is assumed that the reader is familiar with basic computational processes 
such as the solution of systems of linear algebraic equations, non-linear equations 
and ordinary differential equations. Such material is provided by Dahlquist, 
Bjorck and Anderson in Numerical Methods; by Forsythe, Malcolm and Moler 
in Computer Methods for Mathematical Computation; and by Carnaghan, 
Luther and Wilkes in Applied Numerical Analysis. It is also assumed that the 
reader has some knowledge of fluid dynamics. Such knowledge can be obtained 
from Fluid Mechanics by Streeter and Wylie; from An Indroduction of Fluid Dy- 
namics by Batchelor; or from Jncompressible Flow by Panton, amongst others. 

Computer programs are provided in the present text for guidance and to make 
it easier for the reader to write his own programs, either by using equivalent con- 
structions, or by modifying the programs provided. In the sense that the CFD 
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practitioner is as likely to inherit an existing code as to write his own from scratch, 
some practice in modifying existing, but simple, programs is desirable. An IBM- 
compatible floppy disk containing the computer programs may be obtained from 
the author. 

The contents of Vol. 1 are arranged in the following way. Chapter 1 contains 
an introduction to computational fluid dynamics, designed to give the reader an 
appreciation of why CFD is so important, the sort of problems it is capable of 
solving and an overview of how CFD is implemented. The equations governing 
fluid flow are usually expressed as partial differential equations. Chapter 2 de- 
scribes the different classes of partial differential equations and appropriate 
boundary conditions and briefly reviews traditional methods of solution. 

Obtaining computational solutions consists of two stages: the reduction of the 
partial differential equations to algebraic equations and the solution of the 
algebraic equations. The first stage, called discretisation, is examined in Chap. 3 
with special emphasis on the accuracy. Chapter 4 provides sufficient theoretical 
background to ensure that computational solutions can be related properly to the 
usually unknown “exact” solution. Weighted residual methods are introduced in 
Chap. 5 as a vehicle for investigating and comparing the finite element, finite 
volume and spectral methods as alternative means of discretisation. Specific tech- 
niques to solve the algebraic equations resulting from discretisation are described 
in Chap. 6. Chapters 3-6 provide essential background information. 

The one-dimensional diffusion equation, considered in Chap. 7, provides the 
simplest model for highly dissipative fluid flows. This equation is used to contrast 
explicit and implicit methods and to discuss the computational representation of 
derivative boundary conditions. If two or more spatial dimensions are present, 
splitting techniques are usually required to obtain computational solutions effi- 
ciently. Splitting techniques are described in Chap. 8. Convective (or advective) 
aspects of fluid flow, and their effective computational prediction, are examined 
in Chap. 9. The convective terms are usually nonlinear. The additional difficulties 
that this introduces are considered in Chap. 10. The general techniques, developed 
in Chaps. 7 — 10, are utilised in constructing specific techniques for the different 
categories of flow behaviour, as is demonstrated in Chaps. 14—18 of Vol. 2. 


In preparing this textbook I have been assisted by many people. In particular 
I would like to thank Dr. K. Srinivas, Nam-Hyo Cho and Zili Zhu for having read 
the text and made many helpful suggestions. I am grateful to June Jeffery for pro- 
ducing illustrations of a very high standard. Special thanks are due to Susan Gon- 
zales, Lyn Kennedy, Marichu Agudo and Shane Gorton for typing the manuscript 
and revisions with commendable accuracy, speed and equilibrium while coping 
with both an arbitrary author and recalcitrant word processors. 

It is a pleasure to acknowledge the thoughtful assistance and professional 
competence provided by Professor W. Beiglbóck, Ms. Christine Pendl, Mr. R. 
Michels and colleagues at Springer-Verlag in the production of this textbook. 
Finally I express deep gratitude to my wife, Mary, who has been unfailingly sup- 
portive while accepting the role of book-widow with her customary good grace. 


Sydney, October 1987 C. A. J. Fletcher 
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1. Computational Fluid Dynamics: Ап Introduction 


This chapter provides an overview of computational fluid dynamics (CFD) with 
emphasis on its cost-effectiveness in design. Some representative applications are 
described to indicate what CFD is capable of. The typical structure of the equations 
governing fluid dynamics is highlighted and the way in which these equations are 
converted into computer-executable algorithms is illustrated. Finally attention is 
drawn to some of the important sources of further information. 


1.1 Advantages of Computational Fluid Dynamics 


The establishment of the science of fluid dynamics and the practical application of 
that science has been under way since the time of Newton. The theoretical devel- 
opment of fluid dynamics focuses on the construction and solution of the governing 
equations for the different categories of fluid dynamics and the study of various 
approximations to those equations. ` 

The governing equations for Newtonian fluid dynamics, the unsteady Navier- 
Stokes equations, have been known for 150 years or more. However, the devel- 
opment of reduced forms of these equations (Chap. 16) is still an active area of 
research as is the turbulent closure problem for the Reynolds-averaged Navier- 
Stokes equations (Sect. 11.5.2), For non-Newtonian fluid dynamics, chemically 
reacting flows and two-phase flows the theoretical development is at a less advanced 
stage. B 

Experimental fluid dynamics has played an important role in validating and 
delineating the limits of the various approximations to the governing equations. The 
wind tunnel, as a piece of experimental equipment, provides an effective means of 
simulating real flows. Traditionally this has provided a cost-effective alternative to 
full-scale measurement. In the design of equipment that depends critically on the 
flow behaviour, e.g. aircraft design, full-scale measurement as part of the design 
process 15 economically unavailable. 

The steady improvement in the speed of computers and the memory size since 
the 1950s has led to the emergence of computational fluid dynamics (CFD). This 
талеп of uid dynamics complements experimental and theoretical fluid dynamics 

y providing an alternative cost-effective means of simulating real flows. As such it 


offers the means of testing theoretical advances for conditions unavailable exper- 
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imentally. For example wind tunnel experiments are limited to a certain range of 
Reynolds numbers, typically one or two orders of magnitude less than full scale. 

Computational fluid dynamics also provides the convenience of being able to 
switch off specific terms in the governing equations. This permits the testing of 
theoretical models and, inverting the connection, suggests new paths for theoretical 
exploration. 

The development of more efficient computers has generated the interest in CFD 
and, in turn, this has produced a dramatic improvement in the efficiency of the 
computational techniques. Consequently CFD is now the preferred means of 
testing alternative designs in many branches of the aircraft, flow machinery and, to 
a lesser extent, automobile industries. | 

Following Chapman et al. (1975), Chapman (1979, 1981), Green (1982), Rubbert 
(1986) and Jameson (1989) CFD provides five major advantages compared with 
experimental fluid dynamics: 


(i) Lead time in design and development is significantly reduced. 
(ii) CFD can simulate flow conditions not reproducible in experimental model 
tests. 
(ii) CFD provides more detailed and comprehensive information. 
(iv) CFD is increasingly more cost-effective than wind-tunnel testing. 
(v) CFD produces a lower energy consumption. 


Traditionally, large lead times have been caused by the necessary sequence of 
design, model construction, wind-tunnel testing and redesign. Model construction 
is often the slowest component. Using a well-developed CFD code allows al- 
ternative designs (different geometric configurations) to be run over a range of 
parameter values, e.g. Reynolds number, Mach number, flow orientation. Each 
case may require 15 min runs on a supercomputer, e.g. CRAY Y-MP. The design 
optimisation process is essentially limited by the ability of the designer to absorb 
and assess the computational results. In practice CFD is very effective in the early 
elimination of competing design configurations. Final design choices are still 
confirmed by wind-tunnel testing. 

Rubbert (1986) draws attention to the speed with which CFD can be used to 
redesign minor components, if the CFD packages have been thoroughly validated. 
Rubbert cites the example of the redesign of the external contour of the Boeing 757 
cab to accommodate the same cockpit components as the Boeing 767 to minimise 
pilot conversion time. Rubbert indicates that CFD provided the external shape 
which was incorporated into the production schedule before any wind-tunnel 
verification was undertaken. 

Wind-tunnel testing is typically limited in the Reynolds number it can achieve, 
usually short of full scale. Very high temperatures associated with coupled heat 
transfer fluid flow problems are beyond the scope of many experimental facilities. 
This is particularly true of combustion problems where the changing chemical 
composition adds another level of complexity. Some categories of unsteady flow 
motion cannot be properly modelled experimentally, particularly where geometric 
unsteadiness occurs as in certain categories of biological fluid dynamics. Many 


1.1 Advantages of Computational Fluid Dynamics 3 


geophysical fluid dynamic problems are too big or too remote in space or time to 
simulate experimentally. Thus oil reservoir flows are generally inaccessible to 
detailed experimental measurement. Problems of astrophysical fluid dynamics are 
too remote spatially and weather patterns must be predicted before they occur. All 
of these categories of fluid motion are amenable to the computational approach. 

Experimental facilities, such as wind tunnels, are very effective for obtaining 
global information, such as the complete lift and drag on a body and the surface 
pressure distributions at key locations. However, to obtain detailed velocity and 
pressure distributions throughout the region surrounding a body would be pro- 
hibitively expensive and very time consuming. CFD provides this detailed in- 
formation at no additional cost and consequently permits a more precise under- 
standing of the flow processes to be obtained. 

Perhaps the most important reason for the growth of CFD is that for much 
mainstream flow simulation, CFD is significantly cheaper than wind-tunnel testing 
and will become even more so in the future. Improvements in computer hardware 
performance have occurred hand in hand with a decreasing hardware cost. 
Consequently for a given numerical algorithm and flow problem the relative cost of 
a computational simulation has decreased significantly historically (Fig. 1.1). Par- 
alleling the improvement in computer hardware has been the improvement in the 
efficiency of computational algorithms for a given problem. Current improvements 
in hardware cost and computational algorithm efficiency show no obvious sign of 
reaching a limit. Consequently these two factors combine to make CFD increas- 
ingly cost-effective. In contrast the cost of performing experiments continues to 
increase. 

The improvement in computer hardware and numerical algorithms has also 
brought about a reduction in energy consumption to obtain computational flow 
simulations. Conversely, the need to simulate more extreme physical conditions, 
higher Reynolds number, higher Mach number, higher temperature, has brought 
about an increase in energy consumption associated with experimental testing. 

The chronological development of computers over the last thirty years has been 
towards faster machines with larger memories. A modern supercomputer such as 
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the CRAY Y-MP is capable of operating at more than 2000 Megaflops (Dongarra 
1989). A Megaflop is one million floating-point arithmetic operations per second. 
More recent supercomputers, e.g. the NEC SX3, are capable of theoretical speeds 
of 20000 Megaflops. The speed comes partly from a short machine cycle time, that 
is the time required for each cycle of logic operations. The CRAY Y-MP has a cycle 
time of 6 nanoseconds (6 x 1079 s) whereas the NEC SX3 has a cycle time of 2.9 ns. 

А specific operation, e.g. a floating point addition, can be broken up into a 
number of logic operations each one of which requires one machine cycle to 
execute. If the same operation, e.g. floating point addition, is to be applied 
sequentially to a large number of elements in a vector, it is desirable to treat each 
logic operation sequentially but to permit different logic operations associated with 
each vector element to be executed concurrently. Thus there is a considerable 
overlap and a considerable speed-up in the overall execution time if the com- 
putational algorithm can exploit such a pipeline arrangement. 

Modern supercomputers have special vector processors that utilise the pipeline 
format. However vector processors have an effective "start-up" time that makes 
them slower than scalar processors for very short vectors. One can define а break- 
even vector length, N,, for which the vector processor has the same speed as a 
scalar processor. For very long vectors (N — oo) the theoretical vector processor 
speed is achieved. 

To compare the efficiency of different vector-processing computers it is (almost) 
standard practice to consider №, (after Hockney and Jesshope 1981), which is the 
vector length for which half the asymptotic peak vector processing performance 
(№ = оо) is achieved. The actual N,,. is dependent on the specific operations being 
performed as well as the hardware. For а SAXPY operation (S— AX + Y), 
N,,5 37 for a CRAY X-MP and N,,; =238 for a CYBER 205. For most modern 
supercomputers, 30 < М, < 100. 

The speed-up due to vectorisation is quantifiable by considering Amdahl’s law 
which can be expressed as (Gentzsch and Neves 1988) 


G-[(1—P)--P/R]! and R=V(N)/S (1.1) 


where С is the overall gain in speed of the process (overall speed-up ratio) 
V(N) is the vector processor speed for an N component vector process 

S is the scalar processor speed for a single component process 

P is the proportion of the process that is vectorized and 

R is the vector processor speed-up ratio. 

As is indicated in Fig. 1.2 a vector processor with a theoretical (N = oo) vector 
speed-up ratio, R=10, must achieve a high percentage vectorisation, say P >0.75, 
to produce a significant overall speed-up ratio, С. But at this level 0G/0P > 0G/0R. 
Thus modification of the computer program to increase P will provide a much 
bigger increase in G than modifying the hardware to increase V and hence R. In 
addition unless a large proportion of the computer program can be written so that 
vector lengths are significantly greater than №, ,2, the overall speed-up ratio, С, will 
not be very great. 
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The ability to increase the overall execution speed to the limit set by the 
hardware depends partly on the ability of the operating system and compiler to 
vectorise the computational algorithm and partly on choosing computational 
algorithms that are inherently vectorisable (Ortega and Voigt 1985). The struc- 
turing of computational algorithms to permit vectorisation is an important 
research topic but is beyond the scope of this book (see Gentzsch and Neves 1988). 
The long term trend would appear to be towards making the operating system and 
compiler take care of the vectorisation with less emphasis on the user having to 
manipulate the basic algorithm. 

With a pipeline architecture, an efficient vector instruction set and as small a 
cycle time as possible the major means of further increasing the processing speed is 
to introduce multiple processors operating in parallel. Supercomputers are typ- 
ically being designed with up to sixteen processors in parallel. Theoretically this 
should provide up to a factor of sixteen improvement in speed. Experiments by 
Grassl and Schwarzmeier (1990) with an eight-processor CRAY Y-MP indicate 
that 84% of the theoretical improvement can be achieved for a typical CFD code 
such as ARC3D (Vol. 2, Sect. 18.4.1). 

The concept of an array of processors each operating on an element of a vector 
has been an important feature in the development of more efficient computer 
architecture (Hockney and Jesshope 1981). Тһе Шас IV had 64 parallel processors 
and achieved an overall processing speed comparable to the CRAY-1 and CYBER- 
205 even though the cycle time was only 80 ns. However Amdahl’s law, (1.1), also 
applies to parallel processors if R is replaced by N,, the number of parallel 
processors, and P is the proportion of the process that is parallelisable. The relative 
merits of pipeline and parallel processing are discussed in general terms by Levine 
(1982), Ortega and Voigt (1985) and in more detail by Hockney and Jesshope (1981) 
and Gentzsch and Neves (1988). 

The development of bigger and cheaper memory modules is being driven by the 
substantial commercial interest in data storage and manipulation. For CFD 
applications it is important that the complete program, both instructions and 
variable storage, should reside in main memory. This is because the speed of data 
transfer from secondary (disc) storage to main memory is much slower than data 
transfer rates between the main memory and the processing units. In the past the 
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main memory size has typically limited the complexity of the CFD problems under 
investigation. 

The chronological trend of increasing memory capacity for supercomputers is 
impressive. Тһе CDC-7600 (1970 technology) had a capacity of 4х 10° 64-bit 
words. The CYBER-205 (1980 technology) has a capacity of 3 x 107 64-bit words 
and the CRAY-2 (1990 technology) has a capacity of 10° 64-bit words. 

Significant developments in minicomputers in the 1970s and microcomputers 
in the 1980s have provided many alternative paths to cost-effective CFD. The 
relative cheapness of random access memory implies that large problems can be 
handled efficiently on micro- and minicomputers. The primary difference between 
microcomputers and mainframes is the significantly slower cycle time of a micro- 
computer and the simpler, less efficient architecture. However the blurring of the 
distinction between microcomputers and personal workstations, such as the SUN 
Sparcstation, and the appearance of minisupercomputers has produced a 
price/performance continuum (Gentzsch and Neves 1988). 

The coupling of many, relatively low power, parallel processors is seen as a very 
efficient way of solving complex CFD problems. Each processor can use fairly 
standard microcomputer components; hence the potentially low cost. A typical 
system, QCDPAX, is described by Hoshino (1989). This system has from 100 to 
1000 processing units, each based on the L64132 floating point processor. Thus a 
system of 400 processing units is expected to deliver about 2000 Megaflops when 
operating on a representative CFD code. 

To a certain extent the relative slowness of microcomputer-based systems can 
be compensated for by allowing longer running times. Although 15 mins on a 
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Бір. 1.3. Computer speed and memory requirements for CFD (after Bailey, 1986; reprinted with 
permission of Japan Society of Computational Fluid Dynamics) 
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Fig. 1.4. Surface pressure distribution оп а typical military aircraft. Surface pressure contours, 
AC, =0.02 (after Arlinger, 1986; reprinted with permission of Japan Society of Computational 
Fluid Dynamics) 





supercomputer appears to be the accepted norm (Bailey 1986) for routine design 
work, running times of a few hours on a microcomputer may well be acceptable in 
the research and development area. This has the advantage of allowing the CFD 
research worker adequate time to interpret the results and to prepare additional 
cases. 

The future trends for computer speed and memory capacity are encouraging. 
Predictions by Simon (1989) indicate that by 2000 one may expect sustained 
computer speeds up to 106 Megaflops and main memory capacities of 50000 
Megawords. This is expected to be adequate (Fig. 1.3) for predictions of steady 
viscous (turbulent) compressible flow around complete aircraft and to allow global 
design optimisation to be considered seriously. 


12 Typical Practical Problems 


Computational fluid dynamics, particularly in engineering, is still at the stage of 
development where "problems involving complex geometries can be treated with 
simple physics and those involving simple geometry can be treated with complex 
physics" (Bailey 1986). What is changing is the accepted norm for simplicity and 
complexity. Representative examples are provided below. 


1.21 Complex Geometry, Simple Physics 


The surface pressure distribution on a typical supersonic military aircraft is shown 
in Fig. 1.4. The freestream Mach number is 1.8 and the angle of attack is 8°. The 
aircraft consists of a fuselage, canopy, engine inlets, fin, main delta wing and 
forward (canard) wings. In addition control surfaces at the trailing edge of the delta 
wing are deflected upwards 10?. Approximately 19 000 grid points are required in 
each cross-section plane at each downstream location. The complexity of the 
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geometry places a considerable demand on the grid generating procedure. Arlinger 
(1986) uses an algebraic grid generation technique based on transfinite inter- 
polation (Sect. 13.3.4). 

The flow is assumed inviscid and everywhere supersonic so that an explicit 
marching scheme in the freestream direction can be employed. This is equivalent to 
the procedure described in Sect. 14.2.4. The explicit marching scheme is par- 
ticularly efficient with the complete flowfield requiring 15 minutes on a CRAY-1. 
The finite volume method (Sect. 5.2) is used to discretise the governing equations. 
Arlinger stresses that the key element in obtaining the results efficiently is the 
versatile grid generation technique. 


1.2.2 Simpler Geometry, More Complex Physics 


The limiting particle paths on the upper surface of a three-dimensional wing for 
increasing freestream Mach number, M, are shown in Fig. 1.5. The limiting 
particle paths correspond to the surface oil-flow patterns that would be obtained 
experimentally. The results shown іп Fig. 1.5 come from computations (Holst et al. 
1986) of the transonic viscous flow past a wing at 2° angle of attack, with an aspect 
ratio of 3 and a chord Reynolds number of 8 x 106. 

For these conditions a shock wave forms above the wing and interacts with the 
upper surface boundary layer causing massive separation. The region of separation 
changes and grows as М „ increases. The influence of the flow past the wingtip 
makes the separation pattern very three-dimensional The terminology, spiral 
node, etc, indicated in Кір. 1.5 is appropriate to the classification of three- 
dimensional separation (Tobak and Peake 1982). 

The solutions require a three-dimensional grid of approximately 170 000 points 
separated into four partially overlapping zones. The two zones immediately above 
and below the wing have a fine grid in the normal direction to accurately predict 
the severe velocity gradients that occur. In these two zones the thin layer Navier- 
Stokes equations (Sect. 18.1.3) are solved. These equations include viscous terms 
only associated with the normal direction. They are an example of reduced Navier- 
Stokes equations (Chap. 16). In the two zones away from the wing the flow is 
assumed inviscid and governed by the Euler equations (Sect. 11.6.1). 

The grid point solutions in all zones are solved by marching a pseudo-transient 
form (Sect. 6.4) of the governing equations in time until the solution no longer 
changes. To do this an implicit procedure is used similar to that described in 
Sect. 14.2.8. The zones are connected by locally interpolating the overlap region, 
typically two cells. Holst indicates that stable solutions are obtained even though 
severe gradients cross zonal boundaries. 

By including viscous effects the current problem incorporates significantly more 
complicated flow behaviour, and requires a more sophisticated computational 
algorithm, than the problem considered in Sect. 1.2.1. However, the shape of the 
computational domain is considerably simpler. In addition the computational grid 
is generated on a zonal basis which provides better control over the grid point 
locations. 
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Fig. 1.5a-d. Particle paths for upper wing surface flow. (а) М. 20.80 (b) М, = 0.85 (c) М. =0.90 
(d) M,,=0.95 (after Holst et al., 1986; reprinted with permission of Japan Society of Computational 
Fluid Dynamics) 


1.2.3 Simple Geometry, Complex Physics 


To illustrate this category a meteorological example is used instead of an en- 
gineering example. Figure 1.6 shows a four-day forecast (b) of the surface pressure 
compared with measurements (a). This particular weather pattern was associated 
with a severe storm on January 29, 1990 which caused substantial property damage 
in the southern part of England. The computations predict the developing weather 
pattern quite closely. 
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The governing equations (Cullen 1983) are essentially inviscid but account for 
wind, temperature, pressure, humidity, surface stresses over land and sea, heating 
effect, precipitation and other effects (Haltiner and Williams 1980). The equations 
are typically written in spherical polar coordinates parallel to the earth’s surface 
and in a normalised pressure coordinate perpendicular to the earth’s surface. 
Consequently difficulties associated with an irregular computational boundary and 
grid generation are minimal. 

Cullen (1990) indicates that the results shown in Fig. 1.6 were obtained on a 
192 x 120 x 15 grid and used a split explicit finite difference scheme to advance the 
solution in time. This permits the complete grid to be retained in main memory. 432 
time steps are used for a 43 day forecast and require 20 minutes processing time on 
a CYBER 205. 

Cullen (1983) reports that the major problem in extending accurate large-scale 
predictions beyond 3 to 4 days is obtaining initial data of sufficient quality. For 
more refined local predictions further difficulties arise in preventing boundary 
disturbances from contaminating the interior solution and in accurately repre- 
senting the severe local gradients associated with fronts. 

For global circulation modelling and particularly for long-term predictions the 
spectral method (Sect. 5.6) is well suited to spherical polar geometry. Spectral 
methods are generally more economical than finite difference or finite element 
methods for comparable accuracy, at least for global predictions. The application 
of spectral methods to weather forecasting is discussed briefly by Fletcher (1984) 
and in greater detail by Bourke et al. (1977). Chervin (1989) provides a recent 
indication of the capability of CFD for climate modelling. 

The above examples are indicative of the current status of CFD. For the future 
Bailey (1986) states that “more powerful computers with more memory capacity 
are required to solve problems involving both complex geometries and complex 
physics". The growth in human expectations will probably keep this statement 
current for a long time to come. 


1.3 Equation Structure 


A connecting feature of the categories of fluid dynamics considered in this book is 
that the fluid can be interpreted as a continuous medium. As a result the behaviour 
of the fluid can be described in terms of the velocity and thermodynamic properties 
as continuous functions of time and space. 

Application of the principles of conservation of mass, momentum and energy 
produces systems of partial differential equations (Vol. 2, Chap. 11) for the velocity 
and thermodynamic variables as functions of time and position. With boundary 
and initial conditions appropriate to the given flow and type of partial differential 
equation the mathematical description of the problem is established. 

Many flow problems involve the developing interaction between convection 
and diffusion. А simple example is indicated in Fig. 1.7, which shows the tem- 
perature distribution of fluid in a pipe at different times. It is assumed that the fluid 
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is moving to the right with constant velocity u and that the temperature is constant 
across the pipe. 
The temperature as a function of x and t is governed by the equation 


oT от T 


—— — — g — = <х< . . 
Ел Xx 2-2 0 іогх,<х<х; and :>0 (1.2) 


With a suitable nondimensionalisation, appropriate boundary and initial соп- 
ditions are 


Т(хі, )- ТО, і)-0 апа (1.3) 
T(x, 0)--сов zx, – 0.5 <х<0.5 
(1.4) 
-0, х<-0.5 and x>05. 


Equations (1.2-4) provide a mathematical description of the problem. The term 
«д?Т/дх? is the diffusion term and а is the thermal diffusivity. This term is 
responsible for the spread of the nonzero temperature both to the right and to the 


left; if х is small the spread is small. Computational techniques for dealing with 


equations containing such terms are dealt with in Chaps. 7 and 8. 

The term u дТ/дх is the convection term and is responsible for the temperature 
distribution being swept bodily to the right with the known velocity u. The 
treatment of this term and the complete transport equation (1.2) are considered in 
Chap.9. In more than one dimension convective and diffusive terms appear 
associated with each direction (Sect. 9.5). 


Since и is known, (1.2) is linear in Т. However, when solving for the velocity field 
it is necessary to consider equations with nonlinear convective terms. A prototype 
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for such a nonlinearity is given by Burgers’ equation (Sect. 10.1) 


—+u—-—a—>=0. (1.5) 


The nonlinear convective term, и ди/дх, permits very steep gradients іп и to develop 


if « is very small. Steep gradients require finer grids and the presence of the 
nonlinearity often necessitates an additional level of iteration in the computational 


algorithm. 
Some flow and heat transfer problems are governed by Laplace’s equation, 
0$ _ 02$ 
—5+—5=0. 1.6 
ax? ду? 0 (1.6) 


This is the case for a flow which is inviscid, incompressible апа irrotational. In that 
case ф is the velocity potential (Sect. 11.3). Laplace’s equation is typical of the type 
of equation that governs equilibrium or steady problems (Chap. 6). Laplace’s 
equation also has the special property of possessing simple exact solutions which 
can be added together (superposed) since it is linear. These properties are exploited 
in the techniques described in Sect. 14.1. 

For many flow problems more than one dependent variable will be involved 
and it is necessary to consider systems of equations. Thus one-dimensional un- 
steady inviscid compressible flow is governed by (Sect. 10.2) 








до, Agu) 
-- -0, 1,7 
at ax (1.74) 
9(ои) д 2 
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a £0 P= 0 , (1.76) 
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where р is the pressure and Е is the total energy рег unit volume given by 

-P 2 

E=—— +05 ọu* , (1.8) 


7—1 


and у is the ratio of specific heats. Although equations (1.7) are nonlinear the 
structure is similar to (1.5) without the diffusive terms. The broad strategy of the 
computational techniques developed for scalar equations will also be applicable to 
systems of equations. 

For flow problems where the average properties of the turbulence need to be 
included the conceptual equation structure could be written as follows 


ди ди д ди 
шы (аа) , (1.9) 
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where “о” is now a function of the dependent variable и, and S is a source term 
containing additional turbulent contributions. However, it should be made clear 
(Sects. 11.4.2 and 11.5.2) that turbulent flows are at least two-dimensional and often 
three-dimensional and that a system of equations is required to describe the flow. 


1.4 Overview of Computational Fluid Dynamics 


The total process of determining practical information about problems involving 
fluid motion can be represented schematically as in Fig. 1.8. 

The governing equations (Chap. 11) for flows of practical interest are usually so 
complicated that an exact solution is unavailable and it is necessary to seek a 
computational solution, Computational techniques replace the governing partial 
differential equations with systems of algebraic equations, so that a computer can 
be used to obtain the solution. This book will be concerned with the computational 
techniques for obtaining and solving the systems of algebraic equations. 

For local methods, like the finite difference, finite element and finite volume 
methods, the algebraic equations link together values of the dependent variables at 
adjacent grid points. For this situation it is understood that a grid of discrete points 
is distributed throughout the computational domain, in time and space. Conse- 
quently one refers to the process of converting the continuous governing equations 
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to a system of algebraic equations as discretisation (Chap. 3). For a global method, 
like the spectral method, the dependent variables are replaced with amplitudes 
associated with different frequencies, typically. 

The algebraic equations produced by discretisation could arise as follows. A 
typical finite difference representation of (1.2) would be 


Т Tj Tja Ту-1) _ Ә(Т;-1-2Т)-Т%;,) 


At 24x Ax? , (1.10) 


where x=j Ax and t=n At. 
If the solution is known at all grid points x, at time level n, (1.10) can be used to 
provide an algorithm for T7* +, ie. 


ut «лї 
гуту (и) Ta ЕМЕЛ -2Т;- Т), De (1.11) 


Repeated use of (1.11) generates the solution at all interior grid points, x,, at time 
level n+ 1. Incrementing п and substituting the values Т"?! into the right-hand 
side of (1.11) allows the discrete solution to be marched forward in time. 

For a local method, e.g. the finite difference method, the required number of 
grid points for an accurate solution typically depends on the dimensionality, the 
geometric complexity and severity of the gradients of the dependent variables. For 
the low about a complete aircraft a grid of ten million points might be required. At 
each grid point each dependent variable and certain auxiliary variables must be 
stored. For turbulent compressible three-dimensional flow this may require any- 
where between five and thirty dependent variables per grid point. For efficient- 
computation all of these variables must be stored in main memory. 

Since the governing equations for most classes of fluid dynamics are nonlinear 
the computational solution usually proceeds iteratively. That is, the solution for 
each dependent variable at each grid point is sequentially corrected using the 
discretised equations. The iterative process is often equivalent to advancing the 
solution over a small time step (Chap. 6). The number of iterations or time steps 
might vary from a few hundred to several thousand. 

The discretisation process introduces an error that can be reduced, in principle, 
by refining the grid as long as the discrete equations, e.g. (1.10), are faithful rep- 
resentations of the governing equations (Sect. 4.2). If the numerical algorithm that 
performs the iteration or advances in time is also stable (Sect. 4.3), then the 
computational solution can be made arbitrarily close to the true solution of the 
governing equations, by refining the grid, if sufficient computer resources are 
available. 

Although the solution is often sought in terms of discrete nodal values some 
methods, e.g., the finite element and spectral methods, do explicitly introduce a 
continuous representation for the computational solution. Where the underlying 
physical problem is smooth such methods often provide greater accuracy per 
unknown in the discretised equations. Such methods are discussed briefly in 
Chap. 5. 
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1.5 Further Reading 


The purpose of the present text is to provide an introduction to the computational 
techniques that are appropriate for solving flow problems. More specific infor- 
mation is available in other books, review articles, journal articles and conference 
proceedings. 

Richtmyer and Morton (1967) construct a general theoretical framework for 
analysing computational techniques relevant to fluid dynamics and discuss specific 
finite difference techniques for inviscid compressible flow. Roache (1976) examines 
viscous separated flow for both incompressible and compressible conditions but 
concentrates on finite difference techniques. More recently, Peyret and Taylor 
(1983) have considered computational techniques for the various branches of fluid 
dynamics with more emphasis on finite difference and spectral methods. Holt 
(1984) describes very powerful techniques for boundary layer flow and inviscid 
compressible flow. Book (1981) considers finite difference techniques for both 
engineering and geophysical fluid dynamics where the diffusive mechanisms are 
absent or very small. 

Thomasset (1981), Baker (1983) and Glowinski (1984) examine computational 
techniques based on the finite element method and Fletcher (1984) provides 
techniques for the finite element and spectral methods. Canuto et al. (1987) analyse 
computational techniques based on spectral methods. Haltiner and Williams 
(1980) discuss computational techniques for geophysical fluid dynamics. 

The review articles by Chapman (1975, 1979, 1981), Green (1982), Krause 
(1985), Kutler (1985) and Jameson (1989) indicate what engineering CFD is 
currently capable of and what will be possible in the future. These articles have a 
strong aeronautical leaning. A more general review is provided by Turkel (1982). 
Cullen (1983) and Chervin (1989) review the current status of meteorological CFD. 
Review papers on specific branches of computational fluid dynamics appear in 
Annual Reviews of Fluid Dynamics, in the lecture series of the von Karman 
Institute and in the monograph series of Pineridge Press. More advanced com- 
putational techniques which exploit vector and parallel computers will not be 
covered in this book. However Ortega and Voigt (1985) and Gentzsch and Neves 
(1988) provide a comprehensive survey of this area. 

Relevant journal articles appear in AIAA Journal, Journal of Computational 
Physics, International Journal of Numerical Methods in Fluids, Computer 
Methods in Applied Mechanics and Engineering, Computers and Fluids, Applied 
Mathematical Modelling, Communications in Applied Numerical Methods, The- 
oretical and Computational Fluid Dynamics, Numerical Heat Transfer, Journal of 
Applied Mechanics and Journal of Fluids Engineering. Important conferences are 
the International Conference series on Numerical Methods in Fluid Dynamics, 
International Symposium series on Computational Fluid Dynamics, the AIAA 
CFD conference series, the GAMM conference series, Finite Elements in Flow 
Problems conference series, the Numerical Methods in Laminar and Turbulent 
Flow conference series and many other specialist conferences. 


2. Partial Differential Equations 


In this chapter, procedures will be developed for classifying partial differential 
equations as elliptic, parabolic or hyperbolic. The different types of partial 
differential equations will be examined from both a mathematical and a physical 
viewpoint to indicate their key features and the flow categories for which they occur. 

The governing equations for fluid dynamics (Vol. 2, Chap. 11) are partial 
differential equations containing first and second derivatives in the spatial co- 
ordinates and first derivatives only in time. The time derivatives appear linearly but 
the spatial derivatives often appear nonlinearly. Also, except for the special case of 
potential flow, systems of governing equations occur rather than a single equation. 


2.1 Background 


For linear partial differential equations of second-order in two independent 
variables a simple classification (Garabedian 1964, p. 57) is possible. Thus for the 
partial differential equation (PDE) 

д?и au ^u Qu 


ди 
— + B——_ — — -- = . 
xd axdy Соу Рох | Pay Еб 0, (2.1) 


where A to G are constant coefficients, three categories of partial differential 
equation can be distinguished. These are 


elliptic PDE: B?—4AC<0, 
parabolic PDE: В2-4АС-0, (2.2) 
hyperbolic PDE: В2-4АС>0. 


It is apparent that the classification depends only on the highest-order derivatives in 
each independent variable. 
For two-dimensional steady compressible potential flow about a slender body 
the governing equation, similar to (11.109), is 
eh ao 


(-М%)<ғ%%2:9 (2.3) 
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Applying the criteria (2.2) indicates that (2.3) is elliptic for subsonic flow (М. < 1) 
and hyperbolic for supersonic flow (M , > 1). 

If the coefficients, A to G in (2.1), are functions of x, у, и, ди/дх ог ди/ду, (2.2) can 
still be used if A, B and C are given a local interpretation. This implies that the 
Classification of the governing equations can change in different parts of the 
computational domain. 

The governing equation for steady, compressible, potential flow, (11.103), can be 
written in two-dimensional natural coordinates as 


where s and паге parallel and perpendicular to the local streamline direction, and М 
is the local Mach number. Applying conditions (2.2) on a local basis indicates that 
(2.4) is elliptic, parabolic or hyperbolic as M<1, M=1 or M>1. A typical 
distribution of local Mach number, M, for the flow about an aerofoil or turbine 
blade, is shown in Fig. 11.15. The feature that the governing equation can change its 
type in different parts of the computational domain is one of the major complicating 
factors in computing transonic flow (Sect. 14.3). | 

The introduction of simpler flow categories (Sect. 11.2.6) may introduce a change 
in the equation type. The governing equations for two-dimensional steady, 
incompressible viscous flow, (11.82-84) without the @u/ét and 00/01 terms, are 
elliptic. However, introduction of the boundary layer approximation produces a 
parabolic system of PDEs, that is (11.60 and 61). 

For equations that can be cast in the form of (2.1) the classification of the PDE 
can be determined by inspection, using (2.2). When this is not possible, e.g. systems 
of PDEs, it is usually necessary to examine the characteristics (Sect. 2.1.3) to 
determine the correct classification. 

The different categories of PDEs can be associated, broadly, with different types 
of flow problems. Generally time-dependent problems lead to either parabolic or 
hyperbolic PDEs. Parabolic PDEs govern flows containing dissipative mechanisms, 
e.g. significant viscous stresses or thermal conduction. In this case the solution will 
be smooth and gradients wil reduce for increasing time if the boundary conditions 
are not time-dependent. If there are no dissipative mechanisms present, the solution 
will remain of constant amplitude if the PDE is linear and may even grow if the PDE 
is nonlinear. This solution is typical of flows governed by hyperbolic PDEs. Elliptic 
PDEs usually govern steady-state or equilibrium problems. However, some steady- 
state flows lead to parabolic PDEs (steady boundary layer flow) and to hyperbolic 
PDEs (steady inviscid supersonic flow). 


2.1.1 Nature of a Well-Posed Problem 


Before proceeding further with the formal classification of partial differential 
equations it is worthwhile embedding the problem formulation and algorithm 
construction in the framework of a well-posed problem. The governing equations 
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and auxiliary (initial and boundary) conditions are well-posed mathematically if the 
following three conditions are met: 


i) the solution exists, 
ii) the solution is unique, 
iii) the solution depends continuously on the auxiliary data. 


The question of existence does not usually create any difficulty. An exception 
occurs in introducing exact solutions of Laplace's equation (Sect. 11.3) where the 
solution may not exist at isolated points. Thus it does not exist at the location of the 
source, r —r, in (11.53). In practice this problem is often avoided by placing the 
source outside the computational domain, e.g. inside the body in Fig. 11.7. 

The usual cause of non-uniqueness is a failure to properly match the auxiliary 
conditions to the type of governing PDE. For the potential equation governing 
inviscid, irrotational flows, and for the boundary layer equations, the appropriate 
initial and boundary conditions are well established. For the Navier-Stokes 
equations the proper boundary conditions at a solid surface are well known but 
there is some flexibility in making the correct choice for farfield boundary 
conditions. In general an underprescription of boundary conditions leads to non- 
"uniqueness and an overprescription to unphysical solutions adjacent to the 
boundary in question. 

There are some flow problems for which multiple solutions may be expected on 
physical grounds. These problems would fail the above criteria of mathematical 
well-posedness. This situation often arises for flows undergoing transition from 
laminar to turbulent motion. However, the broad understanding of fluid dynamics 
will usually identify such classes of flows for which the computation may be 
complicated by concern about the well-posedness of the mathematical formulation. 

The third criterion above requires that a small change in the initial or boundary 
conditions should cause only a small change in the solution. The auxiliary 
conditions are often introduced approximately in a typical computational 
algorithm. Consequently if the third condition 15 not met the errors in the auxiliary 
data will propagate into the interior causing the solution to grow rapidly, 
particularly for hyperbolic PDEs. 

The above criteria are usually attributed to Hadamard (Garabedian 1964, 
p. 109). In addition we could take a simple parallel and require that for a well-posed 
computation: 


i) the computational solution exists, 
ii) the computational solution is unique, 
iii) the computational solution depends continuously on the approximate auxiliary 
data. 


The process of obtaining the computational solution can be represented 
schematically as in Fig.2.1. Here the specified data are the approximate 
implementation of the initial and boundary conditions. If boundary conditions are 
placed on derivatives of u an error will be introduced in approximating the 
boundary conditions. The computational algorithm is typically constructed from 
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Fig. 2.1. Computational procedure 


the governing PDE (Sect. 3.1) and must be stable (Sect. 4.3) in order for the above 
three conditions to be met. 

Therefore for a well-posed computation it is necessary that not only should both 
the underlying PDE and auxiliary conditions be well-posed but that the algorithm 
should be well-posed (stable) also. It is implicit here that the approximate solution 
produced by a well-posed computation will be close, in some Sense, to the exact 
solution of the well-posed problem. This question will be pursued in Sect. 4.1. 


2.1.2 Boundary and Initial Conditions 


It is clear from the discussion of well-posed problems and well-posed computations 
in Sect. 2.1.1 that the auxiliary data are, in a sense, the starting point for obtaining 
the interior solution, particularly for propagation problems. If we don’t distinguish 
between time and space as independent variables then the auxiliary data specified on 
OR, Fig. 2.2, is “extrapolated” by the computational algorithm (based on the PDE) 
to provide the solution in the interior, R. 


ns 


oR 


Fig. 2.2. Computational domain R 


Auxiliary conditions are specified in three ways: 


i) Dirichlet condition, e.g. и--/ on OR. 


ii) Neumann (derivative) condition, e.g. ди/дп=/ or du/és=g on OR, 


iii) mixed or Robin condition, e.g. du/én+ku=f, k 0 on OR. 


In auxiliary conditions ii) and iii), 0/0n denotes the outward normal derivative. 

For most flows, which require the solution of the Navier-Stokes equations in 
primitive variables (и, v, p, etc.), at least one velocity component is given on an inflow 
boundary. This provides a Dirichlet boundary condition on the velocity. For the 
velocity potential equation governing inviscid compressible flow, the condition that 
дф/дп = 0 at the body surface is a Neumann boundary condition. Mixed conditions 
are rare in fluid mechanics but occur in convective heat transfer. Computationally, 
Dirichlet auxiliary conditions can be applied exactly as long as f is analytic. 
However, errors are introduced in representing Neumann or mixed conditions 
(Sect. 7.3). 


2.1 Background 21 


2.1.3 Classification by Characteristics 


For partial differential equations that are functions of two independent variables the 
classification into elliptic, parabolic or hyperbolic type can be achieved by first 
seeking characteristic directions along which the governing equations only involve 
total differentials. 

For a single first-order PDE in two independent variables, 


A—+B—=C, (2.5) 
x 


a single real characteristic exists through every point and the characteristic direction 
is defined by (Fig. 2.5) 


ах В 


—=— . . (26 

dt А (2.6) 
Along the characteristic direction, (2.5) reduces to 

du C 4 du С (2.7) 


d A I ав 
Equation (2.5) is a hyperbolic PDE and it is possible to integrate (2.7) as ordinary 
differential equations along a grid defined by (2.6), as longas the initial data are given 
on а non-characteristic line. 

The same concept of characteristic directions can be used in conjunction with 
PDEs of second order in two independent variables, e.g. (2.1). Since it is found from 
(2.1 and 2) that only the coefficients of the highest derivatives determine the category 
of PDE, it is convenient to write (2.1) as 

^u ^u ^u 


A—+B——+C 


— -0, 2.8 
дхду aye t (28) 


where Н contains all the first derivative terms etc. in (2.1) апа А, В and С тау be 
functions of x, y. It is possible to obtain, for each point in the domain, two directions 
along which the integration of (2.8) involves only total differentials. The existence of 
these (characteristic) directions relates directly to the category of PDE. 
For ease of presentation the following notation is introduced: 
ди д?и д?и ^u 


ди 
ш-- =— Е---- S = ш--. . 
P дх' 2 ду” ox? ° дхду ' T ду? (2.9) 


A curve К is introduced in the interior of the domain on which Р, О, К, 8, Tand u 
satisfy (2.8). Along a tangent to К the differentials of P and О satisfy 


dP=Rdx+Sdy, (2.10) 
dQ=Sdx+Tdy , (2.11) 
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and (2.8) can be written as 
AR+BS+CT+H=0. (2.12) 


In (2.10 and 11), dy/dx defines the slope of the tangent to K. Using (2.10 and 11), R 
and T can be eliminated from (2.12) to give 


нее an 


If dy/dx is chosen such that 


dy \? dy 
4(%) -2(2 )+c= 0, (2.14) 
(2.13) reduces to the simpler relationship between dP/dx and dQ/dx, 
dP dQ 
-- -0. . 
(ан | +С — dx (2.15) 


Thetwo solutions to (2.14) define the characteristic directions for which (2.15) holds. 
Comparing (2.14) with (2.2) it is clear that if (2.8) is: 


i) a hyperbolic PDE, two real characteristics exist, 
ii) a parabolic PDE, one real characteristic exists, 
iil) an elliptic PDE, the characteristics are complex. 


Thus a consideration of the discriminant, B? — 44C, determines both the type of 
PDE and the nature of the characteristics. 

The classification of the partial differential equation type has been undertaken in 
Cartesian coordinates, so far. An important question is whether a coordinate 
transformation, such as will be described in Chap. 12, can alter the type of the partial 
differential equation. 

Thus new independent variables (е, п) аге introduced in place of (x, у) and it is 
assumed that the transformation, č = (х, y) and 7 = у(х, у) is known. Derivatives are 
transformed as (Sect. 12.1) 





ди ди ди 
L 2E — 2.16 
where €, = 02/0х, etc. After some manipulation, (2.8) becomes 
&u ^u д?и 
А' В' C ' Н'=0, 2.17 
22 +В'- Eon + p 4 (2.17) 
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where 
А’=А + BE, +C, 
В -2А6,ң,-- В(ёп,+&,п,) +26, , and (2.18) 
C' = Ani Bn,n,- Cri . 
The discriminant, (B')? — 4А’С’, then becomes 
(8) —44'C' = J?[B? -44C] , (2.19) 


where the Jacobian of the transform is J =, n, — č, п». Equation (2.19) gives the 
important result that the classification of the PDE is precisely the same whether it is 
determined in Cartesian coordinates from (2.8) or in (č, n) coordinates from (2.17 and 
18). Thus, introducing a coordinate transformation does not change the type of 
PDE. 

To extend the examination of characteristics beyond two independent variables 
is less useful. In m dimensions (m— 1) dimensional surfaces must be considered. 
However, an examination of the coefficients multiplying the highest-order de- 
rivatives can, in principle, furnish useful information. For example, in three 
dimensions (2.8) would be replaced by 

^u д?и ^u u д?и u 


ВИ ACIE A DI— EF 
axi дхду' oy axdz i Qyóz 





+H=0. (2.20) 


It is necessary to obtain a transformation, ё = (х, y, 2), 7 = (х, у, Z), ¢ =C(x, у, 2) such 
that all cross derivatives in (£, n, ©) coordinates disappear. This approach will fail for 
more than three independent variables, in which case it is convenient to replace 
(2.20) with 


а,„-——+Н=0, (2.21) 
жат ^дхудх 


where М is the number of independent variables and the coefficients аҙ replace А (о 
.F in (2.20). The previously mentioned transformation to remove cross derivatives is 
equivalent to finding the eigenvalues 4 of the matrix A with elements a, (see 
footnote). 

The following classification, following Chester (1971, p. 134), can be given: 


i) If any of the eigenvalues 4 is zero, (2.21) is parabolic. 

ii) If all eigenvalues are non-zero and of the same sign, (2.21) is elliptic. 

iii) If all eigenvalues are non-zeto and all but one are of the same sign, (2.21) is 
7 hyperbolic. 


For three independent variables Hellwig (1964, p.60) provides an equivalent 


Underlined bold type denotes matrix or tensor 
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classification in terms of the coefficients multiplying the derivatives in the 
transformed equations. 

In more than two independent variables useful information can often be 
determined about the behaviour of the partial differential equation by considering 
two-dimensional surfaces, ie. by choosing particular coordinate values. Thus for 
(2.20) the character of the equation can be established in the plane x = constant by 
temporarily freezing all terms involving x derivatives and treating the resulting 
equation as though it were a function of two independent variables. 


2.1.4 Systems of Equations 


А consideration of Chap. 11 indicates that the governing equations for fluid 
dynamics often form a system, rather than being a single equation. А two- 
component system of first-order PDEs, in two independent variables, could be 
written 


Qu ди др др 
п ах +В. 25 Tii +812 -Е, , (2.22) 
ди ди дь Qv 
Ang! Pn бау +В 5 = Ез . (2.23) 


Since both и and р are functions of x and у the following relationships hold: 


du= (=) dx + (2) ау, (2.24) 
дх ду 


Ov Ov 
в-(% а ГА (2.25) 


For the problem shown іп Fig. 2.3 it is assumed that the solution has already been 
determined in the region ACPDB. As before, two directions, dy/dx, through P are 


Solution to be 
calculated 


Boundary conditions 
чоцириоэ Aiopunog 


Initial conditions problem 





Fig. 2.3. Schematic representation of the 
computational domain for a propagation 
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sought along which only total differentials, du and dv, appear. For the system of 
equations (2.22, 23) this is equivalent to seeking multipliers, L, and L3, such that 


L, x (2.22)+ L, x (2.23) > т, du+ m, dv=(L,E,+ L,E;) . (2.26) 
Expansion of the terms making up (2.26) establishes the relationships 

L A +І,Аз; =m dx, L,B,,+L,B2,=m,dy , 

LjA,;*LA;—mjdx , 1,В,,+1,В,=тйу. (2.27) 


Eliminating т, апа т; and rearranging gives 
braun red РЕ | (2.28) 
(А,:4у-В,:4х) (Айу B55dx) || І, 
Since this system is homogeneous іп L; it is necessary that 
det[4dy— Bdx]=0, (2.29) 


for a nontrivial solution. For the above example, (2.29) takes the form 


ауу? 
(A11422— Аҙ14,:) (2) —(A1,B;;— А,В, + B1422 


4 
— Bz, А +(ВиВь—В›,В,;)=0. (2.30) 


Equation (2.30) has two solutions and the nature of the solutions depends on the 
discriminant 


DIS =(A,,B22—A2,By2+A22B i, — А, Во 1)? 
-4(А1,4;;- A231 412) (B, B5; — В, В,:) , (2.31) 


which also determines the classification of the system (2.22, 23). The different 
possibilities are indicated in Table 2.1. 

On characteristics the derivatives ди/дх, ди/ду, 0v/Ox апа Ov/Oy аге not defined 
uniquely. In fact, in crossing the characteristics discontinuities in the normal 
derivatives can occur whereas tangential derivatives are continuous. 

Ап example using the above classification can be developed as follows. The 
governing equations for two-dimensional compressible potential flow, (11.103), can 


Table 2.1. Classification of (2.22, 23) 


DIS Roots of (2.30) Classification of the system (2.22, 23) 
positive 2 real hyperbolic 
Zero 1 real parabolic 


negative 2 complex elliptic 
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be recast in terms of the velocity components, i.e. 


и? ди fuv\ ди fuv\ др fv? Qv 
“ түш  (wo|ou (uoo (v^ 1]29. 232 
(% БИӨРШЕЕШЕ )s 0 (232) 
апа 
Qu Qv 
_ ou O9 gy 2.33 
ду ах (2.33) 


Equations (2.32, 33) have the same structure as (2.22, 23). The evaluation of (2.31) 
gives 


DIS=4(M?-1), where M?=(u?+07)/a? , 


and indicates that the system (2.32, 33) is hyperbolic if M > 1. This is the same result 
as was found in considering the compressible potential equation (2.4), This is to be 
expected since, although the equations are different, they govern the same physical 
situation. 

The construction used to derive (2.28 and 29) can be generalised to a system of п 
first-order equations (Whitham 1974, p. 116). Equation (2.28) is replaced by 


(к) 
(4(%) -в|г®=0 , k-1...,n. (2.34) 
“А 4х 


The character of the system (Hellwig 1964, р. 70) depends on the solution of (2.29). 


i) If n real roots are obtained the system is hyperbolic. 
ii) If v real roots, 1&v €n— 1, and no complex roots are obtained, the system is 
parabolic. 
iil) If no real roots are obtained the system is elliptic. 


For large systems some roots may be complex and some тау be real; this gives а 
mixed system. The most important division is between elliptic and non-elliptic 
partial differential equations since elliptic partial differential equations preclude 
time-like behaviour. Therefore the system of equations will be assumed to be elliptic 
if any complex roots occur. 

The above classification extends to systems of second-order equations in two 
independent variables since auxiliary variables can be introduced to generate an 
even larger system of first-order equations. However, there is a risk that both 4 and 
Bare singular so that it may be necessary to consider combinations of the equations 
to avoid this degenerate behaviour (Whitham 1974, p. 115). 

For systems of more than two independent variables (2.29) can be partially 
generalised as follows. A system of first-order equations in three independent 
variables could be written 


42 + вс р, (2.35) 
дх ду 02 
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where 4 is the vector of n dependent variables. Equation (2.35) leads to the nth order 
characteristic polynomial (Chester 1971, р. 272) 


det[A44, + BA, +CA,]=0 , (2.36) 


where A,, Ay, 4, define a normal direction to a surface at (х, у, 2). Equation (2.36) 
generalises (2.29) and gives the condition that the surface is a characteristic surface. 
Clearly, for a real characteristic surface (2.36) must have real roots. If n real roots аге 
obtained the system is hyperbolic. 

It is possible to ask what the character of the partial differential equation is with 
respect to particular directions. For example setting А, = 4, = 1 and solving for 4, 
indicates that (2.35) is elliptic with respect to the y direction if any imaginary roots 
occur. Clearly each direction can be examined in turn. 

Here we provide a simple example of a system of equations based on the steady 
incompressible Navier Stokes equations in two dimensions. In nondimensional 


form these are 


u,+v,=0 , (2.37a) 
1 | 

uu, + Duy + р, р (u,,+u,,)=0 , (2.37b) 
1 

uv, +vv,+ py— Re (v,,+v,,)=0 , (2.37с) 


where и, = ди/дх, etc., Re is the Reynolds number апа и, v, р are the dependent 
variables. Equations (2.37) are reduced to a first-order system by introducing 
auxiliary variables R=v,, S=v, and T=u,. Thus (2.37) can be replaced with 


Uy =T, 
u,t+v, =0, 
—В +5 =0 
* > 2.38 
\ 8, +L -0, (2.38) 
S,/Re — T,/Re +p, 2uS—vT , 
'— R,/Re — S,/Re +p, =uR—-vS . 


The particular choices for (2.38) are made to avoid the equivalent of A and Bin (2.35) 
being singular. The character of the above set of equations can be determined by 
replacing 0/дх with А, and д/ду with A, and setting the determinant to zero, as in 
(2.36). The result is 


(1/Re)A2(A2+ 422? 20 . (2.39) 


Setting 4, = 1 indicates that A, is imaginary. Setting 4,— 1 indicates that imaginary 
roots exist for 4,. Therefore it is concluded that the system (2.37) is elliptic. 
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The general problem of classifying partial differential equations may be pursued 
in Garabedian (1964), Hellwig (1964), Courant and Hilbert (1962) and Chester 
(1971). 


2.1.5 Classification by Fourier Analysis 


The classification of partial differential equations by characteristics (Sects. 2.1.3 and 
2.1.4) leads to the interpretation of the roots of a characteristic polynomial, e.g. 
(2.36). The roots determine the characteristic directions (or surfaces in more than 
two independent variables). 

However, the same characteristic polynomial can be obtained from a Fourier 
analysis of the partial differential equation. In this case the roots have a different 
physical interpretation, although the classification of the partial differential 
equation in relation to the nature of the roots remains the same. The Fourier 
analysis approach is useful for systems of equations where higher than first-order 
derivatives appear, since it avoids the construction of an intermediate, but enlarged, 
first-order system. The Fourier analysis approach also indicates the expected form 
of the solution, e.g. oscillatory, exponential growth, etc. This feature is exploited in 
Chap. 16 in determining whether stable computational solutions of reduced forms of 
the Navier-Stokes equations can be obtained in a single spatial march. 

Suppose a solution of the homogeneous second-order scalar equation 


A—+B——+C—,=0 (2.40) 
y 
is sought of the form 


щ»у= У X tuexplive,);xlexplilo, 1 (241) 
ј= – ю k=- œ . 

The amplitudes of the various modes are determined by the boundary conditions. 

However, the nature of the solution will depend on the (¢,); and (¢,), coefficients, 

which may be complex. If A, Band C are not functions of u the relationship between 

c, and a, is the same for all modes so that only one mode need be considered in 

(2.41). Substituting into (2.40) gives 


—Ael—Bo,c,—Coi-0 ог 
A(o,/0,)? + В(о,/в,)+С=0. (2.42) 


This is a characteristic polynomial for с, /о, equivalent to (2.29). The nature of the 
partial differential equation (2.40) depends on the nature of the roots, and hence on 
A, B and C as indicated by (2.2). 

The Fourier analysis approach produces the same characteristic polynomial 
from the principal part of the governing equation as does the characteristic analysis. 
However, if, is assumed real, the form of the solution is wavelike in the y direction. 
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Then the solution of the characteristic polynomial (2.42) formed from the complete 
equation indicates the form of the solution in the x direction. 

An examination of (2.41) indicates the similarity with the Fourier transform 
definition (Lighthill 1958, p. 8), 


$(0,,0,) = Г Г u(x, ууехр(-ів,х)ехр(-іс,уйхау, (2.43) 


or, notationally, й = Fu. 
To analyse the character of partial differential equations, use is made of the 
following results: 


ди ди 
і0,й-Е--, і0,й-Е--. 2.44 
10 Эх ісу ду (2.44) 
Thus the characteristic polynomial is obtained by taking the Fourier transform of 
the governing equation. As an example (2.40) is transformed to 


[4(0,)2 + Bio, ic,) + С(їо,)210=0 , (2.45) 


and (2.42) follows directly. The characteristic polynomial derived via the Fourier 
transform is often called the symbol of the partial differential equation. 

The Fourier transform approach to obtaining the characteristic polynomial is 
applicable if A, B or C are functions of the independent variables. If A, B or C are 
functions of the dependent variables it is necessary to freeze them at their local 
values before introducing the Fourier transform. MEE 

The application of the Fourier analysis approach to systems of equations can be 
illustrated by considering (2.37). Freezing the coefficients и and v in (2.37, с) and 
taking Fourier transforms of u, v and p produces the following homogeneous system 
of algebraic equations: 


іс, id, 0 й 
А 1 . 
(ио „+ vo,) + Re (ci + о?) 0 io, $ о, 
+ 1 2 2 . 
0 i(uo, + vay) + Re (в --ау) іс, ф 
(2.46) 
which leads to the characteristic polynomial, ае ] = 0, i.e. 
(02 + 02) [i(uo, + vo) - (1/Re) (c2 +02)]=0 . (2.47) 


However, (2.47) contains the group i(uo,+va,), which corresponds to first 
derivatives of u and v. But the character of the system (2.37) is determined by the 
principal part, which explicitly excludes all but the highest derivatives. In this case 
(2.47) coincides with (2.39) and leads to the conclusion that (2.37) is an elliptic system. 
It is clear in comparing (2.46) with (2.38) that the Fourier analysis approach avoids 
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the problem of constructing an equivalent first-order system and the possibility that 
it may be singular, 

The roots of the characteristic polynomial produced by the Fourier analysis are 
interpreted here in the same way as in the characteristic method to determine the 
partial differential equation type. An alternative classification based on the 
magnitude of the largest root of the characteristic polynomial is described by 
Gelfand and Shilov (1967). 

The Fourier analysis approach is made use of in Sect. 16.1 to determine the 
character of the solution produced by a single downstream march. In that situation 
all terms in the governing equations, not just the principal part, are retained in the 
equivalent of (2.47). 


2.2 Hyperbolic Partial Differential Equations 


The simplest example of a hyperbolic PDE is the wave equation, 


2 2 
би ди 0. (2.48) 


For initial conditions, u(x, 0)=sin zx, du/ét(x, 0) 20, and boundary conditions, 
u(0, t)=u(1, t) 20, (2.48) has the exact solution 


u(x, t) 2 sin zx cos лі. (2.49) 


The lack of attenuation is a feature of linear hyperbolic PDEs. 

The convection equation, considered in Sect. 9.1, is a linear hyperbolic PDE. 
The equations governing unsteady inviscid flow are hyperbolic, but nonlinear, as 
are the equations governing steady supersonic inviscid flow (Sect. 14.2). 


2.21 Interpretation by Characteristics 


Hyperbolic PDEs produce real characteristics. For the wave equation (2.48) the 
characteristic directions are given by dx/dt = +1. In the (x, 2) plane, the character- 
istics through a point P are shown in Fig. 2.4, 

For the system of equations (2.32, 33) there are two characteristics, given by 


2,,2 2 2 - | 
ЕСЕ ын 


Clearly the characteristics depend on the local solution and will, in general, be 
curved (Courant and Friedrichs, 1948). 

For the first-order hyperbolic PDE (2.5) a single characteristic, dt/dx = A/B, 
passes through every point (Fig. 2.5), If A and B are constant the characteristics are 
straight lines. If A and В are functions of u, x or t, they are curved. For the linear 
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Domain of 
influence 





B - characteristic t 


--<5іоре 
х; X 


Fig. 24. Characteristics for the wave equation Fig. 2.5. Characteristics for a first-order 
hyperbolic PDE, (2.5) 


convection equation considered in Sect. 9.1 the ratio B/A is just the velocity. For 
three-dimensional steady inviscid supersonic flow the characteristic surface forms a 
cone about the local flow direction as indicated in Fig. 11.14. 

For hyperbolic PDEs it is possible to use the characteristic directions to develop 
a computational grid on which the compatibility conditions, for example (2.15), 
hold. This is the strategy behind the method of characteristics, Sect. 2.5.1. For 
reasons to be discussed in Sect. 14.2.1, this method is now mainly of historic interest. 
However it is useful for determining far-field boundary conditions (Sect. 14.2.8). 


2.2.2 Interpretation on a Physical Basis 


As noted above, hyperbolic PDEs are associated with propagation problems when 
no dissipation is present. The occurrence of real characteristics, as in Fig. 2.4, implies 
that a disturbance to the solution u at P can only influence the rest of the solution in 
the domain CPD. Conversely the solution at P is influenced by disturbances in the 
domain APB only. 

In addition, if initial conditions are specified at t =0, і.е. on AB in Fig. 2.4, these 
are sufficient to determine the solution at P, uniquely. This can be demonstrated, for 
(2.48) as follows. 

New independent variables (£, п) are introduced as 


C=x+t, mgox-t, (2.51) 
$0 that (2.48) reduces to 


^u _ 
дей, | 





0, (2.52) 


which has the general solution 


ще, =f (&) + g(n) , (2.53) 
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where fand g are arbitrary twice-differentiable functions. If (2.48) is solved as a pure 
initial value problem it is appropriate to introduce the initial conditions 


u(x, 0)=S(x) , ШЫ 0)2 T(x) . (2.54) 
It can be shown (Ames 1969, p. 165) that, for t —0, 


f(x)=0.5 |569 + | г | +С, (2.55) 


g(x) = 0.5 | sto - í rtr | 4D, 
0 


where C and D are integration constants. It then follows from (2.53) that the general 
solution of (2.48) with initial conditions given by (2.54) is 


u(x, t) = 0.5 | T S(x—10)- 7 roar | . (2.56) 


ті 


In particular, if the point P has coordinates (х;, t;), the solution at Р is 


u(x;, 11) - 0.5 Ес t;)+ S(x; — tj) + 4 " roar | , (2.57) 


xiti 


i.e. the solution at P is determined uniquely by the initial conditions on AB (Fig. 2.4). 

For hyperbolic equations there is no dissipative (or smoothing) mechanism 
present. This implies that if the initial data (or boundary data) contain disconti- 
nuities they will be transmitted into the interior along characteristics, without 
attenuation of the discontinuity for linear equations. This is consistent with the 
result indicated in Sect. 2.1.3 that discontinuities in the normal derivatives can occur 
in crossing characteristics. 

It should be emphasised here that in considering the equations that govern 
supersonic inviscid flow, which are hyperbolic, the discontinuities must be small to 
be consistent with isentropic flow.: For supersonic inviscid isentropic flow the 
governing equations (2.32, 33) produce characteristic directions given by (2.50). If 
the solution is such that the characteristics run together a non-unique solution 
would result (Whitham 1974, p. 24); in practice a shock-wave occurs. However, 
there is a change in entropy across the shock-wave and this invalidates the 
assumption of isentropic flow on which (2.32 and 33) are based. Therefore the 
shock-wave forms a boundary (internal or external) of the domain in which (2.32 
and 33) are valid. 


2.2.3 Appropriate Boundary (and Initial) Conditions 


It has already been indicated (Sect. 2.2.2) that for the wave equation (2.48) the initial 
conditions (2.54) are suitable, and, depending on the extent of AB, will determine the 


2.2 Hyperbolic Partial Differential Equations 33 


solution, uniquely, in the domain APB (Fig. 2.4). It is also possible to specify 
boundary conditions (Sect. 2.1.2), for example as on CD and EF in Fig. 2.8. 

Here we reconsider the equations (2.22, 23), since these are directly applicable to 
supersonic inviscid flow (with particular choices of A,,, etc.), and ask what are 
appropriate choices ofthe auxiliary conditions so that a unique solution to (2.22 and 
23) is possible. The characteristic directions arising from the equivalent of (2.50) will 
be labelled « and f characteristics. Three cases (shown in Fig. 2.6) are considered 
initially. 

The case shown in Fig. 2.6a is equivalent to that shown in Fig. 2.4. That is, data 
for both u and v on a non-characteristic curve, AB, uniquely determine the solution 


B~- chorocteristics 


X- сһогосіегізіісз T. / ~~ 






specified 
CASE (о) 


<. characteristics 


voru 
specified 


CASE (Ы) 







x- B- characteristics 


choracteristics 


могу 
specified 


A Fig. 2.6a-c. Auxiliary data specifications for (2.22 and 23) 
CASE (c) when hyperbolic 
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Fig. 2.7. Boundary conditions for the un- 
steady interpretation of (2.22 and 23) 





specified 


up to P. For the case shown in Fig. 2.6b AB is a non-characteristic curve but AD is a 
В characteristic. For this case и or v should be given on one curve matched to v or u 
on the other. Thus both u and v are known at A. A similar situation occurs for the 
case shown in Fig. 2.6c except that both AB and AD are characteristic curves. 

Equations (2.22 and 23) may be interpreted as unsteady equations by replacing y 
with t. A consideration (Fig. 2.7) of the computational domain x20 and tz 0 
indicates that a point P close to the boundary х=0 is partly determined by 
boundary conditions on AC and partly by initial conditions on AB, assuming that 
the governing PDEs are hyperbolic. Appropriate auxiliary conditions for this case 
are u and v specified on AB and v or u specified on AC. 

These examples, Figs. 2.6 and 2.7, illustrate the general rule for hyperbolic PDEs 
that the number of auxiliary conditions is equal to the number of characteristics 
pointing into the domain (Whitham 1974, p.127). The direction along the 
characteristic needs to be chosen consistently. For time-dependent problems the 
positive direction will be in the direction of increasing time. For multidimensional 
steady hyperbolic spatial problems in primitive variables one characteristic 
(“associated” with the continuity equation) coincides with the local streamline, Thus 
through a boundary point this characteristic defines the positive direction and 
indicates the positive direction for the other characteristics through the same point. 


2.3 Parabolic Partial Differential Equations 


Parabolic PDEs occur when propagation problems include dissipative mechanisms, 
such as viscous shear or heat conduction. The classical example of a parabolic PDE 
is the diffusion or heat conduction equation 


ди Qu 


-----т. 2.58 
ôt ax? (2.58) 
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Equation (2.58) will be used to introduce different computational techniques in 
Chap. 7. 

For initial conditions u=sinzx and boundary conditions u(0, t) 2 u(1, t)=0, 
(2.58) has the exact solution 


u(x, t)=sin zx exp( —7?t) . (2.59) 


The exponential decay in time shown by (2.59) may be contrasted with the 
oscillatory solution (2.49) of the wave equation (2.48). 

“Тһе transport equation (Sects. 9.4 and 9.5) is a linear parabolic PDE, and 
Burgers’ equation, considered in Sect. 10.1, is a nonlinear parabolic PDE. How- 
ever, the Cole-Hopf transformation (Fletcher 1983) permits Burgers’ equation 
to be converted into the diffusion equation (2.58). The unsteady Navier-Stokes 
equations are parabolic. These equations are used both for unsteady problems and 
when a pseudo-transient formulation (Sect. 6.4) is introduced to solve a steady 
problem. For purely steady flow, boundary layers (Chap. 15) and shear layers are 
typically governed by parabolic PDEs, with the flow direction having a time-like 
role. Many of the reduced forms of the Navier-Stokes equations (Chap. 16) are 
governed by parabolic PDEs. 


2.3.1 Interpretation by Characteristics 


Interpretation of (2.58) as (2.8) with y=t indicates that A=1, B= C=0so that (2.58) 
is parabolic. Solution of (2.14) indicates that there is a single characteristic direction 
defined by dt/dx=0. A typical computational domain for (2.58) is indicated in 
Fig. 2.8. In contrast to the situation for hyperbolic equations, derivatives of и аге 
always continuous in crossing the t=t; line. Characteristics do not play such a 
significant role as for hyperbolic PDEs. There is no equivalent to the method of 
characteristics for parabolic PDEs. Clearly, laying out a computational grid to 
follow the local characteristics would never advance the solution in time. 


2.32 Interpretation on a Physical Basis 


Parabolic problems are typified by solutions which march forward in time but 
diffuse in space. Thus a disturbance to the solution introduced at P (in Fig. 2.8) can 


Фо, ne 
Шы 


х Fig. 2.8. Computational domain for a 
хао u(x,0) = ч2|х) ха! parabolic PDE 
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influence any part of the computational domain for tz &. However, the magnitude 
of the disturbance quickly attenuates in moving away from Р. For steady two- 
dimensional boundary layer flow (Chap. 15) the characteristics are normal to the 
flow direction and imply no upstream influence. 

The incorporation of a dissipative mechanism also implies that even if the initial 
conditions include a discontinuity, the solution in the interior will always be 
continuous. Partial differential equations in more than one spatial direction that are 
parabolic in time become elliptic in the steady state (if a steady-state solution exists). 


2.3.3 Appropriate Boundary (and Initial) Conditions 
For (2.58) it is necessary to specify Dirichlet initial conditions, e.g. 
u(x, 0)= u(x) for OSxS1. (2.60) 


Appropriate boundary conditions would be 
ди 
и(0, t)=g(t) and Эх (1, t) - h(t) . (2.61) 


For the boundaries CD and EF (Fig. 2.8) any combination of Dirichlet, Neumann or 
mixed boundary conditions (Sect. 2.1.2) 1s acceptable. However, it is desirable, in 
specifying Dirichlet boundary conditions, to ensure continuity with the initial 
conditions at C and E. Failure to do so will produce a solution with severe gradients 
adjacent to C and E, which may create difficulties for the computational algorithm. 
For systems of parabolic PDEs, initial conditions on CE and boundary conditions 
on CD and EF are necessary for all dependent variables. 


2.4 Elliptic Partial Differential Equations 


For fluid dynamics, elliptic PDEs are associated with steady-state problems. The 
simplest example of an elliptic PDE is Laplace’s equation, 


=0, (2.62) 


which governs incompressible, potential flow. For boundary conditions 
ф(х, 0) = ѕіп пх, ф(х, 1)-віп лхехр(-л), (0, y)=¢(1, у)-0, 
(2.62) has the solution 
(x, y)=sin zx exp( — лу) (2.63) 


іп the domain 05х51, OS yl. 
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The Poisson equation for the stream function, (11.88), in two-dimensional 
rotational flow is an elliptic PDE. As noted above, the steady Navier-Stokes 
equations and the steady energy equation are also elliptic. 

For second-order elliptic PDEs of the form (2.1), an important maximum 
principle exists (Garabedian 1964, p.232). Namely, both the maximum and 
minimum values of ф must occur on the boundary OR, except for the trivial case that 
ф is a constant. The maximum principle is useful in testing that computational 
solutions of elliptic PDEs are behaving properly. 


2.4.1 Interpretation by Characteristics 


For the general second-order PDE (2.1), which is known to be elliptic, ie. 4АС < B?, 
the characteristics are complex and cannot be displayed in the (real) computational 
domain. For elliptic problems in fluid dynamics, identification of characteristic 
directions serves no useful purpose. 


2.4.2 Interpretation on a Physical Basis 


The most important feature concerning elliptic PDEs is that a disturbance 
introduced at an interior point P, as in Fig. 2.9, influences all other points in the 
computational domain, although away from P the influence will be small. This 
implies that in seeking computational solutions to elliptic problems it is necessary to 
consider the global domain. In contrast, parabolic and hyperbolic PDEs can be 
solved by marching progressively from the initial conditions. Discontinuities in 
boundary conditions for elliptic PDEs are smoothed out in the interior. 


$ огдф/дп 
specified 


N 





UN 


д or дф/дп ^ 
um | | 
Fig. 2.9. Typical domain Юг an elliptic PDE 


2.4.3 Appropriate Boundary Conditions 


The ability to influence all other points in the domain from an interior point implies 
that boundary conditions are required on all boundaries (Fig. 2.9). The boundary 
conditions can be any combination of Dirichlet, Neumann or mixed (Sect. 2.1.2) 
boundary conditions. However, if a Neumann condition, дф/дп= (s), is applied on 
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all boundaries, where п is the outward normal and 5 is measured along the boundary 
contour, care must be taken that the specification is consistent with the governing 
equation. From Green’s theorem, 


$ fds=—[V?oadV . (2.64) 
дЕ Е 


Clearly, if the governing equation is the Laplace or Poisson equation, (2.64) implies 
an additional global constraint on the Neumann boundary condition specification. 
When (2.62) represents steady, incompressible, potential flow and ф is the velocity 
potential, f is just the normal velocity. Thus for steady, incompressible, potential 
flow, (2.64) coincides with the conservation of mass, (11.7). The computational 
implementation of (2.64) is discussed in Sect. 16.2.2. For systems of elliptic PDEs 
boundary conditions are required on all boundaries for all dependent variables. 

For parabolic and hyperbolic PDEs it is always possible to obtain the local 
solution immediately adjacent to a boundary by a series expansion. Attempts to do 
the same with an elliptic PDE typically produce an infinite solution, due to the fact 
that elliptic PDEs are not well-posed for the case where boundary conditions are not 
specified on a closed boundary. 


2.5 Traditional Solution Methods 


In this section we briefly describe three techniques that may be considered pre- 
computer methods, requiring only hand or primitive machine calculation. These 
methods work well for simple model problems but are less effective for the more 
complicated equations governing fluid flow. However, they are sometimes useful in 
suggesting a method of solution or obtaining an approximate or local solution. 


2.5.1 The Method of Characteristics 


This method is only applicable to hyperbolic PDEs. It is described here for a second- 
order PDE in two independent variables, which was considered previously in Sect. 
2.1.3, 


2 
——+С— =0. 
+ ay? +H (2.65) 
Solution of (2.14) will furnish two roots, 
dy 
af and dx А (2.66) 


For two adjacent points on the characteristics defined by (2.66) the compatibility 
equation (2,15) can be approximated by 
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ol- characteristics 8 - characteristics Fig. 2.10. Method of characteristics 
a 





АФ aP+CAQ+ Hay=0 | (2.67) 


It may be recalled from Sect. 2.1.3 that Р=ди/дх and О =ди/ду so that, for the 
same two adjacent points, 


Au=PAx+Qdy . (2.68) 


It will be assumed that и, Р and О are known along some non-characteristic 
boundary (Fig. 2.10). Initially both the solution and the locations for interior 
points, like d and e, are unknown. Two equations can be obtained from (2.66) to 
provide the location of d. These are 


у.—У=Ем(х.-х.) and (2.69) 
Ya — Ys = бы(ха– хь), where (2.70) 
Е,-0.5(Е,-Е,) and G,,=0.5(G,+G,) . 


Effectively, the curved lines ad and bd have been replaced by straight lines 
determined by averaging the slope at the end points. 

If x, and y, were known it would be possible to obtain P, and Q, from (2.67) in 
the form 


Aga¥sa(P4— Pa) + Coa(Qa— Qa) + Haal Ya—Ya)=0 (2.71) 

A, GC, 7 Рь)+Сы(0,—0ь)+Ны(у.- у)=0 . (2.72) 
If P, and О. were known (2.68) could be used, as follows, to obtain u,: 

u4— Ug= Ёы(х— Xa) t Qu, Ya) . - (2.73) 


In practice (2.69--73) must be solved iteratively to obtain x4, у, Pa, Qa and ил. For 
the first step of the iteration the following approximations are used: 


F,-F,, GQ-2G,, ete. (2.74) 
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Typically two or three iterations are required as long as d is not too far from a and b. 
The method progresses by marching along the grid defined by the local 
characteristics which are determined as part of the solution. The above formulation 
is described in a fluid dynamic context by Belotserkovskii and Chushkin (1965). 

The method of characteristics has been widely used in one-dimensional unsteady 
gas dynamics and for steady two-dimensional supersonic inviscid flow. However, 
the method is rather cumbersome when extended to three or four independent 
variables, or if internal shocks occur. For supersonic inviscid flow the method of 
characteristics is useful for determining the number and form of appropriate far- 
field boundary conditions. 


2.5.2 Separation of Variables 


This method is applicable to PDEs of any classification. It will be illustrated here for 
the diffusion equation: 
ди ди 
ôt ax? 
іп the domain shown іп Fig. 2.11. Тһе initial and boundary conditions are also 
shown in Fig. 2,11. The method introduces a general separable solution 


(2.75) 


u(x, t)- X (x) T(t). (2.76) 
Substitution into (2.75) gives 


d?X 


da +А4Х=0 and (2.77) 
ат 
2154-90. (2.78) 


where 4 is an arbitrary constant. Equation (2.77) has an infinite number of solutions 
of the form 


A,(x)=A, sin kx , (2.79) 


where A, =k*, к= 1,2,3 .. . and A, are constants to be determined by the boundary 
and initial conditions. Consequently (2.78) also has an infinite number of solutions 


v(o,t] zo v(TI.t)2o 





v(x.o) s f(x) п Гір. 2.11. Auxiliary conditions for (2,75) 
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of the form 
7, (t)= B,exp(—k*t) , (2.80) 


where B, are constants to be determined by the initial and boundary conditions. 
Substituting (2.79 and 80) into (2.76) implies the general solution 


u(x,t)= Y, Cysinkxexp(—k?9) . (2.81) 
кті 


Equation (2.81) satisfies the boundary conditions of the problem. The constants С, 
are obtained from satisfying the initial conditions 


У Cysinkx=u(x,0)= f(x) , ie. (2.82) 
k=1 - . 


C,- (2) ЕС sinksds . (2.83) 
0 


For this particular problem the contribution from exp(— K?t)for large k is very small 
for t>0. Therefore it is usually only necessary to retain 10 or 20 terms in the 
expansion (2.81). 

The separation of variables method relies on the availability of a coordinate 
system for which @R coincides with coordinate lines. It also implies that the 
operators in the PDE will separate. Consequently, although the method is effective 
on model problems it does not find much direct use for the rather complicated 
equations governing fluid motion, often in irregular domains. However, an 
interesting discussion of the method is provided by Gustafson (1980, pp. 115-138). 


2.5.3 Green’s Function Method 


For a PDE written in the general manner 
Lu=f , (2.84) 


a solution can be constructed, in principle, by “inverting” the operator L. The 
solution is expressed in integral form as 


u(p) = | Ср, 4) ааи, , (2.85) 


where С(р,4) is the Green's function. In general С(р, а) contains information 
equivalent to the operator L, the boundary conditions and the domain. Conse- 
quently the major difficulty in using the Green's function method is in determining 
what the Green's function should be to suit the particular problem. The subsequent 
evaluation of (2.85) is usually straightforward. 

Green's functions can be obtained for relatively simple linear equations like 
Laplace's equation and the Poisson equation. For example, a point source of unit 
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strength in three dimensions has the Green’s function 
S(p,q) - 1/4nr,, , (2.86) 


where r,, is just the distance between p and а. This formula is effectively equivalent to 
the two-dimensional velocity potential given by (11.53) with т= 1. Carrying out the 
required differentiation indicates that 


— ViS(p,q)-ó(p,a) , (2.87) 


where ó(p, q) is the Dirac delta centred at p and V? is the Laplacian evaluated at 4. А 
property of the Dirac delta function is that 


f w(q)d(p, gdV,-w(p) and б(р,ф-0 if ржа. (2.88) 
R 


In (2.88) w(q) is an arbitrary smooth function. 
А solution procedure can be established by invoking Green's second identity, 


[(uv?v -vV?u dV 4 | ий? 0и) 5-0. | (2.89) 
R жі дп On 


In the present situation, u in (2.89) is identified with the solution of the Poisson 
equation 


`У?и=-Гоп В, и-0 on ôR (2.90) 
апа 
v-G(pq)-S(pq)h-g(pq) with v=0 оп OR. (2.91) 


Consequently (2.89) reduces to 


fofdv=—[uVodv . | (2.92) 
Е Е 


The function 9 (р, q) is chosen so that V2g =0in К and G(p, а) 20 when qis оп OR. As 
a result, (2.88 and 92) give the solution 


шр) = | С(р, a) (a)4V, . (2.93) 
R 


The Green’s function method is implicit in the panel method (Sect. 14.1) and is used 
almost directly in the boundary element method (Sect. 14.1.3). 

For some elliptic PDEs it is possible to construct an equivalent variational 
principle and to use a Rayleigh-Ritz procedure (Gustafson 1980, p. 161). Although 
such a technique is standard for structural applications ofthe finite element method, 
the elliptic PDEs that occur in fluid dynamics do not usually possess an equivalent 
variational form, 


2.6 Closure 


In this chapter we have examined the classification of PDEs into hyperbolic, 
parabolic and elliptic type. All three types occur for various simplifications of the 
fluid dynamic governing equations (Chap. 11). However, systems of equations may 
also be of mixed type. Hyperbolic PDEs are usually associated with propagation 
problems without dissipation (wave-like motion remains unattenuated) and para- 
bolic PDEs are usually associated with propagation problems with dissipation. In 
fluid dynamics the dissipation usually comes from the viscous or heat conduction 
terms or eddy-viscosity type turbulence modelling. Elliptic PDEs are associated 
with steady-state problems. 

Each type of PDE requires different boundary (and initial) conditions and may 
lend themselves to particular solution techniques. For example the method of 
characteristics is ‘natural’ for hyperbolic PDEs in two independent variables. For 
the nonlinear equations governing fluid dynamics the classification of the PDE can 
change locally. Consequently boundary conditions should be chosen to suit the 
classification of the PDE adjacent to the boundary. 

The changing classification of the governing PDEs in different parts of the 
domain can be illustrated by considering supersonic viscous flow past a two- 
dimensional wing. For this example the governing equations are the Navier-Stokes 
equations which, due to the appearance of the second derivatives, are strictly elliptic 
when interpreted according to Sect. 2,1,2. However, such a classification takes no 
account of the magnitude of the relevant terms. In fact the viscous terms are only 
significant close to the surface where the streamwise viscous dissipation is an order- 
of-magnitude smaller than the cross-stream viscous dissipation; and the governing 
equations are mixed parabolic/hyperbolic. Away from the body all the viscous 
terms are small and the equation system is effectively hyperbolic, When shock 
waves occur the severe gradients away from the body cause the viscous (and heat 
conduction) terms to be significant so that the governing equations are locally 
elliptic (within the thickness of the shock-wave). This is sufficient to replace the 
discontinuous solution (in the inviscid approximation) with a severe, but continu- 
ous, gradient. 

Clearly the strict mathematical classification of the governing PDEs should be 
tempered by a knowledge of the physical processes involved to ensure that correct 
auxiliary conditions are specified and appropriate computational techniques are 
used. | 


2.7 Problems 
Background (Sect. 2.1) 


2.1 a) Transform Laplace’s equation, 02ф/0х? +02ф/0у? =0, into generalised 
coordinates £2 (х, y, у= (х, у) and show that the resulting equation is 
elliptic. 

b) Transform the wave equation, д2$ф/0Е? — 02ф/0х? =0, into generalised 
coordinates £— £(t, x), n=n(t,x) and show that the resulting equation is 
hyperbolic. 
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2.2 


2.3 
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Convert the Kortweg-de Vries equation (Jeffrey and Taniuti 1984 and (9.27)), 


ди „ди aru 
êt ôx д? 


into an equivalent system of first-order equations by introducing auxiliary 
variables р=ди/дх, etc. Deduce that the resulting system of equations is 
parabolic. 

The nondimensional equations governing steady inviscid incompressible low 
are 


Determine the type of the system of partial differential equations. 


Hyperbolic PDEs (Sect. 2.2) 


2.4 


2.5 


Show by inspection that the second-order PDE 02,/дх04-- 0 is hyperbolic. 
Consider the equivalent system 


22-0 апа 
до 
2:79 . 


Deduce that this system is hyperbolic and that the x and t axes are 
characteristics, 

Consider the modified wave equation 

Qu s,s д?и U | 

———fR-—+u=0. 2.94 
a Pit" (234) 
Show, Бу inspection that this equation is hyperbolic. Consider the related 
system of equations 


a "9 А 
Qv дм 
ow ge о 


2.6 
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Show that this system is hyperbolic and determine the characteristic directions. 
What is the connection between (2.94) and (2.95)? Does this explain. the extra 
characteristic in (2.95)? 

The governing equations for one-dimensional unsteady isentropic inviscid 
compressible flow are 


до идо оди 
-0, 
Qt ox öx 
ou ди 1 др _ 
ôt Ox Qóx ` 


"where P = ko" and а? = yp/g. Here a is the speed of sound. Show that 


this system is hyperbolic and that the characteristics are given by 
dx/dt = uta. 


Parabolic PDES (Sect. 2.3) 


2.7 


2.8 


2.9 


(а) Convert the equation 0$/0t —«0?$/0x? — 0 to an equivalent system Бу 
introducing an auxiliary variable p= 0$/0x. Show that the system is parabolic. 
(b) Analyse 09/0t — a (0^ $/0x? + 0*9/0y?) = O in a similar way and show that it 
is parabolic. 

Consider the transport equation 6u/ét -2c0u/0x — й0?ијдх? =0 with initial 
conditions и(х,0)--ехр(сх/4) and boundary conditions u(0, t) - exp( — c?t/d) 
and u(1, t) = (d/c) Qu/0x(1, t). Show that the equation is parabolic and determine 
the solution. - 
The equations governing steady incompressible boundary layer flow over а flat 
plate can be written 


— 20, 
272 

ді, ди | ди 
"Ox ду Re dy? 


Show that this system is parabolic and suggest suitable initial and boundary 
conditions for и and v. 


Elliptic PDEs (Sect. 2.4) 


2.10 Consider the equations 


Qu Ov Qu dv 

—+—=0, -----9. 2.96 
ax Әу ду Ox (290) 
Show that this system is elliptic, 

(a) directly, 

(b) by introducing the variable ф, where и--0ф/дх and р=дф/ду. 
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2.11 Show that the expressions u = x/(x* + у?), v — y/(x? + у?) are а solution of (2.96). 
2.12 Show that the equations 


ди | ди 1 би | 0u -0 
"x Р ду Ке(дх2 ду2/ ’ 
ôx ду Ве\дх? 0y?] 


form an elliptic system and that they are satisfied by the expressions 

u= — 2 [а, a4 y - k(exp[k(x — xo)] - exp[ — k(x — хо) ] } cos(ky)]/(Re D) , 
v= —2[a5rayx — k(exp[k(x — x9)] +exp[—k(x—x 9)]} sin(ky)](ReD) , - 
where 

D = [ag +a; х+а,у+азху+ (exp[k(x — xo)] - expE—k(x — xo9)]) cos(ky)] 
and a9,a,, 05, a4, k and хо are arbitrary constants. 


Traditional Methods (Sect. 2.5) 


2.13 Consider the solution of ?T/Ox? 4-0?T/0y?—0 on a unit square, with 
boundary conditions 


ТО,у-0, T(Ly)s0, Tlx,0)=T,..., Т(х,1)=0. 


Apply the separation of variables technique to obtain 
T(x,y)= У, Aysin(knx) sinh [kn(y—1)] with 
k=1 


. QTo/ kn) C- 1)*— 1] 


А, біп Ел) 


2.14 The equation 0$/0t —a0?$/0x^ = 0 is to be solved іп the domain 05х51, 
і>0 with boundary conditions $(0,t) 20, $(1, t) 2 фр and initial condition 
ф(х, 0) 2 0. Show, via the separation of variables technique, that the solution is 


æ аф. (- 1)*ехр(— k?n2at)sin(knx 
ф=фих+ Y фа(- 1) i: )ѕіп(клх) | 
k=1 


2.15 Show that the expression 








1 — wx 
exp ( _(х—у) ) 
/4лї 4t 
is the Green’s function for the heat conduction problem considered in Problem 
2.14, by showing that it satisfies (2.75) with y fixed. 
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In this chapter an examination will be made of some of the basic computational 
techniques that are required to solve flow problems. For a specific problem the 
governing equations (Chap.11) and the appropriate boundary conditions 
(Chaps. 11 and 2) will be known. Computational techniques are used to obtain an 
approximate solution of the governing equations and boundary conditions. 

For example, for three-dimensional unsteady incompressible flow, velocity and 
pressure solutions, u(x, у, 2, t), v(x, у, 2, t), w(x, у, 2, t) and p(x, у, 2, t), would be 
computed. The process of obtaining the computational solution consists of two 
stages that are shown schematically in Fig.3.1. The first stage converts the 
continuous partial differential equations and auxiliary (boundary and initial) 
conditions into a discrete system of algebraic equations. This first stage is called 
discretisation (Sect. 3.1). The process of discretisation is easily identified if the finite 
difference method is used (Sect. 3.5) but is slightly less obvious with the finite 
element, finite volume and spectral methods (Chap. 5). 


GOVERNING APPROXIMATE 
PARTIAL DISCRET- х, 5Ү5ТЕМ ОҒ EQUATION SOLUTION, 


ALGEBRAIC [Г | 
DIFF, EQS. ISAT АТО тон, 
AND В.С. ^$ SATION EQUATIONS 50 “ыж z ) 





Fig. 3.1. Overview of the computational solution procedure 


The replacement of individual differentiated terms in the governing partial 
differential equations by algebraic expressions connecting nodal values on a finite 
grid introduces an error. Choosing the algebraic expressions in a way that produces 
small errors is considered in Sect. 3.2. The achieved accuracy of representing the 
differentiated terms is examined in Sects. 3.3 and 3.4. Equally important as the 
error in representing the differentiated terms in the governing equation is the error 
in the solution. A simple finite difference program is provided in Sect. 3.5 so that the 
solution error can be examined directly. 

In discussing unsteady problems the discretisation process is often identified 
with the reduction of the governing partial differential equations to a system of 
ordinary differential equations in time. This is understandable in the sense that 
techniques for solving ordinary differential equations (Lambert 1973) are so well- 
known that further discussion may not be required. However, in applying a 
particular method, the system of ordinary differential equations must be converted 
to a corresponding system of algebraic equations to obtain the computational 
solution. 
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The second stage of the solution process (Fig, 3.1) requires an equation solver to 
provide the solution of the system of algebraic equations. This stage can also 
introduce an error but it is usually negligible compared with the error introduced in 
the discretisation stage, unless the method is unstable (Sect. 4.3). Apprepriate 
methods for solving systems of algebraic equations are discussed in Chap. 6. 
Systems of algebraic equations typically arise in solving steady flow problems. For 
unsteady flow problems the use of explicit techniques (e.g. Sect. 7.1.1) may reduce 
the equation-solving stage to no more than a one-line algorithm. 


3.1 Discretisation 


To convert the governing partial differential equation(s) to a system of algebraic 
equations (or ordinary differential equations), a number of choices are available. 
The most common are the finite difference, finite element, finite volume and spectral 
methods. 

The way the discretisation is performed also depends on whether time derivat- 
ives (in time dependent problems) or equations containing only spatial derivatives 
are being considered. In practice, time derivatives are discretised almost exclusively 
using the finite difference method. Spatial derivatives are discretised by either the 
finite difference, finite element, finite volume or spectral method, typically. 


3.1.1 Converting Derivatives to Discrete Algebraic Expressions 


The discretisation process can be illustrated by considering the equation 


oT 02Т 

===, (3.1) 

ді дх 
which governs transient heat conduction in one dimension. Т is the temperature 
and a is the thermal diffusivity. The overbar ( ) denotes the exact solution. Typical 
boundary and initial conditions to suit (3.1) are 


T(0,)=b, T(,n)-d and (3.2) 
Т(х,0)-Т.,(х), 0<х<1. (3.3) 


The most direct means of discretisation is provided Бу replacing the derivatives by 
equivalent finite difference expressions. Thus, using (3.21, 25), (3.1) can be replaced 
by 


Tt} Тр a Tj.,—2T; Т.) | 


At Ax? (3.4) 


The step sizes At, 4x and the meaning of the subscript j and superscript n are 
indicated in Fig. 3.2. In (3.4) T} is the value of Т at the (j, n)th node. 
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t Fig. 3.2. The discrete grid 


х 
0 Ах 24х (j-1)Ax (2-1) Ах 


The process of discretising (3.1) to give (3.4) implies that the problem of finding 
the exact (continuous) solution T(x, t) has been replaced with the problem of 
finding discrete values Ту, ie. the approximate solution at the (j, n)th node 
(Fig. 3.2). In turn, two related errors arise, the truncation error and the solution 
error. The truncation error introduced by the discretisation of (3.1) will be 
considered in Sects. 3.3 and 3.4. The corresponding (solution) error between the 
approximate solution and the exact solution will be examined in Sect. 4.1. 

The precise value of the approximate solution between the nodal (grid) points is 
not obvious. Intuitively the solution would be expected to vary smoothly between 
the nodal points. In principle, the solution at some point (x,, t,) that does not 
coincide with a node can be obtained by interpolating the surrounding nodal point 
solution. It will be seen (Sect. 5.3) that this interpolation process is automatically 
built into the finite element method. 

It is apparent that, whereas (3.1) is a partial differential equation, (3.4) is an 
algebraic equation. With reference to Fig. 3.2, (3.4) can be manipulated to give a 
formula (or algorithm) for the unknown value 7771 in terms of the known values 
T at the nth time level, i.e. 





«At | 
Тр" = Tj + al T1277 + Tha) . (3.5) 


To provide the complete numerical solution at time level (n+ 1), (3.5) must be 


applied for all the nodes j=2,..., J—1, assuming that Dirichlet boundary 
conditions provide the values 7771 and 7771, 


3.1.2 Spatial Derivatives 


It has already been seen how the finite difference method discretises spatial 
derivatives, e.g. 027/0х2 in (3.1) becomes (Т/_,—2Т] + T7, ,)/4x? in (3.4). 

The finite element method (Sect. 5.3) achieves discretisation by first assuming 
that the local solution for T can be interpolated. Subsequently the local solution is 
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substituted into a suitably weighted integral of the governing equation and the 
integrals evaluated. A typical result (using linear elements on a uniform mesh) 
would be 








(AxIG( T7) — Ту.) , ФАЗ тр ТУ) | (аб Tht Ths) 
At At At 


аСТ. -2T} + Th) 


Ax (3.6) 


Dividing both sides of (3.6) by Ах produces a result that is similar in structure to 
(3.4). Equation (3.6) is derived in Sect. 5.5.1. 

The spectral method (Sect.5.6) proceeds in a similar manner to the finite 
element method except that the assumed solution for Т 15 of the form 


T= У а(0ф0), (3.7) 


where а (г) are unknown coefficients to be determined as part of the solution and 
ф(х) are known functions of x (see Sect. 5.6). The final form of the discretised 
equation using the spectral method can be written 


J а; — у, руа" , (3.8) 


where р; аге known algebraic coefficients. 

Whatever method is used to perform the discretisation the subsequent solution 
of the equations, e.g. using (3.5), is obtained directly from the algebraic equations 
and is, in a sense, independent of the means of discretisation. 


3.1.3 Time Derivatives 


The replacement of 27/01 in (3.1) with the one-sided difference formula 
(T7 * ! — Tr)/ At only uses information at time-levels n and n+ 1. Because time only 
proceeds in the positive direction, information at time-levels n+ 2 and greater is not 
available. In (3.4) the finite difference representation of the spatial derivative 
02Т/2х2 has been evaluated at time-level п and provides an explicit algorithm for 
ТҮЗІ If the spatial terms were evaluated at time-level n+ 1 the following implicit 
algorithm would be obtained: 


—sT? (1-23 T?* ТТ", (3.9) 


where s=adt/dx?. Equation (3.9) can only be solved as part of a system of 
equations formed by evaluating it for all nodes j 22, ..., J—1 (See Sect. 7.2). 

If the centred difference formula (T?* ! — T7^!)/2At were used to replace 0T/6t 
in (3.1) the following explicit algorithm can be constructed for TF *! : 
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Tr = ТР + (24 /4х?) (ТЕ 2T] € Tj) (3.10) 


Тһе algorithm (3.10) is more accurate than (3.5) but more complicated since it 
involves three levels of data, n—1, n, n+1, rather than two. This particular 
algorithm is not practical since it is unstable (Sect. 7.1.2). However the use of 
centred time differencing with other equations, eg. the convection equation 
(Sect. 9.1), is stable. 

There is an alternative approach to discretising time derivatives which builds 
on the connection with ordinary differential equations. Equation (3.1) can be 
written as 


Of oir. (3.11) 


where L is the differential operator 202/0х2. After spatial discretisation, (3.11) 
becomes 


і Т. (3.12) 


where L, is an algebraic operator resulting from the spatial discretisation, Equation 
(3.12), repeated for each node, represents a system of ordinary differential equations 
in time. Consequently, in principle, any of the techniques for integrating ordinary 
differential equations (Gear 1971) can be applied to (3.12). In general the inte- 
gration can be written 


inti 


Ти =T"+ [1,Т4. (3.13) 
fa 


The Euler scheme for evaluating (3.13) is 
T?" -TP«-[LT; At, (3.14) 


which is identical with (3.5) if L, is the finite difference operator given in (3.4). 
Because of the errors associated with the spatial discretisation operator L,, there is 
usually no advantage in using a very high-order integration formula in (3.13). Some 
of the more effective algorithms in this category are considered in Sect. 7.4. 


3.2 Approximation to Derivatives 


In Sect. 3.1 typical algebraic formulae were presented to illustrate the mechanics of 
discretising derivatives like 02 Т/дх?. Here such algebraic formulae are constructed, 
first by inspection of a Taylor series expansion and secondly via a general 
procedure. In each case an estimate of the error involved in the discretisation 
process is readily available. 
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3.2.1 Taylor Series Expansion 


As the first step in developing an algorithm to compute values of T that appear in 
(3.1), the space and time derivatives of T'at the node ( j, n) are expressed in terms of 
the values of Т at nearby nodes. Taylor series expansions such as 


xs ape | (3.15) 


poy 15 | (3.16) 
J 


are used in the process. These series may be truncated after any number of terms, 
the resulting ({runcation) error being dominated by the next term in the expansion 


if Ax «1 in (3.15) or if At «1 in (3.16). Thus we may write 


oT oT 
T" -TIA С Ax? 3.17 
үн M 44 [> 2 ss | tolx ) l ) 


Тһе term О(4х?) is interpreted as meaning there exists а positive constant К, 
depending on Т, such that the difference between T at the (j+ 1, nìth node and the 
first three terms оп the right-hand side of (3.17), all evaluated at the ( j, nìth node, is 
numerically less than K Ax? for all sufficiently small 4x. Clearly the error involved 
in this approximation rapidly reduces in magnitude as the size of 4x decreases. 

А consideration of (3.17) suggests that a finite difference expression for 2Т/дх 
could be obtained directly. Thus, by rearranging (3.17), 


271" (Т, Т) eT 
ML 4 2... 3,18 
Е |- 4x 7 054х Ox? j E (3.18) 
It is apparent that using the finite difference replacement 
ôT " Ту, 1 — Ту 
— | e+ 3.19 
Е | j Ax | 6.19) 


is accurate to O(Ax). The additional terms appearing in (3.18) are referred to as the 
truncation error. Equation (3.19) is called a forward difference approximation. By 
expanding 77. , as a Taylor series about node (j, п) and rearranging, a backward 
difference approximation can be constructed: « 


oT n Т! _ Th, 
— | se. 3.2 
E | Ах 9.20) 


This, like (3.19), introduces ап error of O(4x). A geometric interpretation of (3.19 
and 20) is provided in Fig. 3.3. Equation (3.19) evaluates [д Т/ох]" jas the slope BC; 
(3.20) evaluates [07/0х]" as the slope AB. 
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Fig. 33 Finite difference representations of 2Т/дх 





Equations (3.19 and 20) have been obtained by introducing a Taylor series 
expansion in space. The Taylor series expansion in time, (3.16), can be manipulated 
to give the forward difference approximation 


отр ттт" 
I [em l (821) 
1 


which introduces an error of O(At), assuming that 4t<1 and higher-order de- 
rivatives are bounded. 


3.2.2 General Technique 


The finite difference expressions provided in Sect. 3.2.1 have been constructed by a 
simple manipulation of a single Taylor expansion. A more methodical technique 
for constructing difference approximations is to start from a general expression, e.g. 


Tl . MEN 
EB —aT* , bT* -cT5, +0(4x") , (3.22) 
j 


where a, b and c are to be determined and the term O(Ax") will indicate the 
accuracy of the resulting approximation. 
Using (3.15) we may write 


_ — = _. тр 
aT5_,+b6T%+cT},,=(at+b+c) рулевом о; | 


4 


Ax? [2T | A [TT 
---|--з - ---|-ч TEE 3.23 
tato- ÉE a+c) < EAE (3.23) 


Setting 
a+b+c=0, (-а+с)4х=1 gives 


a-c—l/Ax and Б= –2с+1/4х foranyc. 
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Choosing c so that the third term on the nght-hand side of (3.23) disappears 
produces the most accurate approximation possible with three disposable par- 
ameters. That is 


c=—a=1/24x апа b=0. 


Substitution of these values into (3.23) gives 


OTY 1, uL uo AX[OTT] 
la | "zi TIT [s | + m 


1 


Therefore the centred (or central) difference approximation to [д Т/дх]), 15 


ФТ " Tj, 1 — Т%- 1 
— | 2s 3.24 
Е | 2Ax (3-24) 


which has a truncation error of O(Ax?). Clearly the centred difference approxi- 
mation produces a higher-order truncation error than the forward (3.19) or 
back ward (3.20) difference approximations. Equation (3.24) evaluates [eT /ax}j as 
the slope AC in Fig. 3.3. 

Using a similar representation to (3.22) the following centred difference form for 
[2?Т/0х?]" can be obtained as 
ôT " Т).1-2Т7;%7), 2 
E | = 2:2 +O(Ax*) . (3.25) 
The above technique, (3.22), can be used to obtain one-sided difference formulae by 
expanding about an appropriate node. The same technique, (3.22), can also be used 
to develop multidimensional formulae or difference formulae on a non-uniform 
grid (Sect. 10.1.5). 


3.2.3 Three-point Asymmetric Formula for [07/0x]} 


The general technique for obtaining algebraic formulae for derivatives (Sect. 3.2.2) 
is used to derive the three-point one-sided representation for [0T/0x]5. The starting 
point is the following general expression, in place of (3.22), 


УШ . = = 
[ёс аут ‚+77, Охе). (3.26) 
j 


where a, b, and c are to be determined. Т", , and Т". аге expanded about j as 
Taylor series (Sect. 3.2.1). Substituting into (3.26) and rearranging gives 


АЕС T*--(bAx * c2Ax) ШІ 


дх |) j 


bAx? 222 PH 
4 4 a... (3.27) 
( 2 2 дх? |, 
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Comparing the left- and right-hand sides of (3.27) indicates that the following 
conditions must be imposed on а, b and с to obtain the smallest error: 


bAx? | c(2Ax)? _ 


at+b+c=0, БАх+с24х=1 , + 0. 








2 2 
This gives the values 
1.5 0.5 
а= – 45 , b=—, апа c= – x , 
апа 
Be -1.57-27",,-057%,, ЗЕН 
ôx |; Ах 3 | ax? ' 


which agrees with the result given in Table 3.3. This formula has a truncation error 
of О(Ах2) like the centred difference formula (3.24). 
If more terms are included in (3.26), e.g. 


атр noi - _ - 
Есали , 


extra conditions to determine the coefficients а to е are obtained from (3.27) 
extended by requiring that the coefficients multiplying higher-order derivatives are 
zero. However schemes based on higher-order discretisations often have more 
severe stability restrictions (Sect. 4.3) than those based on low-order discretisations. 
Consequently an alternative strategy is to choose some of the coefficients a to e to 
reduce the error and some to improve the stability. A similar approach is taken in 
constructing schemes to solve ordinary differential equations (Hamming 1973, 
p. 405). 


3.3 Accuracy of the Discretisation Process 


Discretisation is necessary to convert the governing differential equation into an 
equivalent system of algebraic equations that can be solved using a computer. The 
discretisation process invariably introduces an error unless the underlying exact 
solution has a very elementary analytic form. Thus the centred difference formula 
(3.24) is exact for polynomials up to quadratic, whereas the one-sided formulae 
(3.19, 20) are exact only for linear polynomials. The exactness can be inferred from 
the fact that all terms in the truncation error are zero for polynomials of sufficiently 
low order. 
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In general the error for a finite difference representation of a derivative can be 
obtained by making a Taylor series expansion about the node at which the 
derivative is being evaluated (Sect. 3.2.2), and the evaluation of the leading term in 
the remainder provides a close approximation to the error if the grid size is small. 
However, the complete evaluation of the terms in the Taylor series relies on the 
exact solution being known. 

A more direct way of comparing the accuracy of various algebraic formulae for 
derivatives is to consider a simple analytic function, like T=exp x, and to compare 
the value of the derivative obtained analytically and obtained from the discretis- 
ation formula. Table 3.1 shows such a comparison for 4Т/йх evaluated at x=1, 
with T=exp x as the analytic function; the step size 4x —0.1. Generally the three- 
point formulae, whether symmetric or asymmetric, are considerably more accurate 
than either the (two-point) forward or backward difference formulae, but consider- 
ably less accurate than the five-point symmetric formula. It is apparent (Table 3.1) 
that the leading term in the Taylor expansion (Т.Е.) gives a good estimate of the 
error, if Ax is sufficiently small. For this particular example all higher-order 
derivatives in the Taylor expansion equal exp x. For a more general problem where 
higher-order derivatives may be larger a step size of less than 0.1 may be necessary 
to ensure the error is closely approximated by the leading term in the truncation 
error. 


Table 3.1. Comparison of formulae to evaluate dT/dx at x = 1.0 


Case Algebraic formula [Z] Error Leading term 
4х А, їп Т.Е. 

Ехасі — 2.7183 — — 

3PT SYM (T;.,- T, 24x 2.7228 0.4533x 107? 04531 x 107? 
FOR DIFF (Т,,,-Т,/Ах 2.8588 0.1406x 107° 0.1359 x 107° 
BACK DIFF (T;-T, ,)/4x _ 2,5868 -01315х1079 —0.1359 x 107° 
ЗРТ ASYM (-1.57)--27;,,-0.57;, /Ах 2.7085 —0,9773х1072 -09061 x 107? 
SPT SYM (Т,-,-8Т,-,+8Т,.,-Т,+2)/12Ах 2.7183 —0.9072x1075 —0.9061 x 1075 


Typical algebraic formulae for d*T/dx? evaluated at х = 1.0 for function values 
of T=exp x are shown in Table 3.2. The function values are evaluated at intervals 
4х=0.1. As before, the three-point symmetric formula is accurate, but now the 
three-point asymmetric formula is inaccurate. As with the evaluation of the first 
derivative formulae, the leading term in the Taylor expansion provides an accurate 
estimate of the error. ` 

The algebraic formulae for the leading term in the truncation error expressions 
are shown in Tables 3.3 and 3.4. These formulae are obtained by making a Taylor 
expansion about the jth node as in Sect. 3.2.1. In Table 3.3, 7,,, 2 d? T/dx?, etc. For 
this particular example ( T—exp x), 7,,,—7,,,,, etc. Thus the magnitude of the 
error depends primarily on powers of Ах. Consequently, as Ax is reduced it is 
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Table 3.2. Comparison of formulae to evaluate 42Т/4х? at x=1.0 








T2 
Case Algebraic formula ЕН Error Leading term 
dx* J; in Т.Е. 

Exact — 2,7183 — — 
3PTSYM (T, ,-2T,4*T,. Ax? 2.7205 0.2266 x 10-2 0.2265 x 107? 
3PT ASYM (7,—27,,,+7;+2)/4%? 3.0067 0.2884 x 107° 0.2718 x 107° 
SPTSYM  (-T7,.;4167,.,—30T, 

167,4, -Tj /124x? 2.7183 —0.3023 x 1075 —0.3020 x 1075 





Table 3.3, Truncation error leading term (algebraic): dT/dx 


Case Algebraic formula Truncation error 
leading term 

3PT SYM (Т,,1-Т,-1У24х Ax*T,.,/6 

FOR DIFF (Т,,-Т)У/Ах AxT,,,/2 

BACK DIFF о = (7,—T,_,)/Ax _ — АхТ {2 

3PT ASYM (—1.57,+27,,,—0.57,, )/4х -Ах2Т,,/3 

SPT SYM (T,-1—87T,., 4 87,,, —T)4)/124x — Ах*Тхх/30 


Table 3.4. Truncation error leading term (algebraic): 42 T/dx? 


r 


Case | Algebraic formula Truncation error 
leading term 


3PT SYM (7,.,-2Т,%Т,, yj Ax? Ax?T, 712 
3PT ASYM (7,-2Т,.,%7,,)/4х - AxT,,. 
SPT SYM (—T)- i4 16T,., — 307; 

: +16Т,+..-Т,+2)/124х? -АхТ, 2/90 





expected that the truncation error, when using the five-point formula, will reduce 
far more quickly than the error when using the two-point forward or backward 
difference formulae. 

The reason for the large error associated with the three-point asymmetric 
formula shown in Table 3.2 is apparent in Table 3.4 where the leading term in the 
truncation error is seen to be only first-order accurate. 

Roughly the directly computed error, E, may be written 


Е = A(Ax}* 
and the truncation error as 


TE = B(Ax}'... , 


where k is the exponent of the grid size in the leading term of the truncation error, as 
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Fig. 3.4. Convergence results for the evaluation of Fig, 3.5. Convergence results for the valuation 
dT/dx. E=|[dT/dx)pp/[dT/dx),, — 11 of d?T/dx?. E=|[d?T/dx?],p/[d?T/dx7],,— И 


in Tables 3.3 and 3.4 for example. Therefore it is expected that the directly computed 
error will reduce with 4x in the manner shown in Tables 3.3 and 3.4. This is 
confirmed by the results shown in Figs. 3.4 and 3.5. By plotting on а log-scale, k 15 
given by the slope of the data, and corresponds to the exponent of the gridsize in the 
leading term in the truncation error. It is clear from the data plotted that the various 
cases are achieving the expected convergence rate implied by the truncation error 
leading terms in Tables 3.3 and 3.4. The convergence rate can still be estimated from 
the truncation error even when the exact solution is unknown. Thus one may infer 
from a truncation error with a fourth-order (k = 4) leading term (SPT SYM) that the 
solution error decreases at a much faster rate with grid refinement than the solution 
error corresponding to a truncation error with a second-order (k = 2) leading term 
(3PT SYM). 


3.3.1 Higher-Order vs Low-Order Formulae 


From the results presented so far it might appear that a higher-order formula оп а 
fine grid should always be used. However this is deceptive. First, the evaluation of a 
higher-order formula involves more points and hence is less economical than the 
evaluation of a low-order formula. From a practical perspective, the accuracy that 
can be achieved for a given execution time, or the computational efficiency, is more 
important than the accuracy alone; the accuracy can always be increased by 
refining the grid. Computational efficiency is considered in Sect. 4.5. ` 
Second, higher-order formulae show а relatively small accuracy advantage over 
low-order formulae for a coarse grid but demonstrate a much greater accuracy 
advantage when the grid is refined. However for a particular problem it is often the 
case that the general accuracy level required of the answers is appropriate to a 
coarse grid or that a coarse grid is necessary because of computer memory or 
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execution time limitations. The superiority of the higher-order formulae shown in 
Figs. 3.4 and 3.5 is also dependent on the smoothness of the exact solution. Inviscid 
supersonic flows can produce discontinuous solutions, associated with the presence 
of shock waves (Liepmann and Roshko 1957, p. 56), If the solution is discontinuous 
the validity of the techniques (Sects. 3.2.1 and 3.2.2) for constructing the difference 
formulae is compromised since there is no guarantee that successive terms in the 
truncation error expansion reduce in magnitude. As a result the inclusion of more 
points in the finite difference expression and the cancellation of more terms in the 
truncation error expansions implies nothing about the corresponding solution 
accuracy. This can be seen for the exact solution shown in Fig. 3.6. 


Using the three-point symmetric formula: (4Т/4х|,-|--0.5/Ах 
Using the five-point symmetric formula: [47/4х],., = —7/(124x) 


Since the exact solution is [d7/dx], . = — oo, the five-point formula is not appreci- 
ably more accurate than the three-point formula. 





[Ах le Axle Ах-э|+-Ах ^] 





Fig. 3.6. Discontinuous (exact) solution 


1.0 x 


For viscous problems at high Reynolds number (i.e. little natural dissipation) 
discontinuities cannot occur but very severe gradients do occur. If the gradient is 
severe enough and the grid coarse enough higher-order schemes are not advan- 
tageous. This can be illustrated by the function 


j-tanh[k(x—1)] . 


This is plotted for three values of k in Fig. 3.7. Clearly, there is a gradient centred at 
x=1 whose severity grows with k. The first derivative dy/dx has been evaluated at 
х=0.96 using the three-point and five-point symmetric formulae (Table 3.1) for 
decreasing 4x with k=5 and 20. 

The result is shown in Fig. 3.8. It is noticeable that the five-point formula only 
produces superior accuracy if the grid is sufficiently refined. The necessary degree of 
refinement increases as the gradient becomes sharper (increasing k). For some 
intermediate values of 4x the five-point formula produces a less accurate evalu- 
ation of the derivative. The corresponding comparison for the second derivative 
evaluation is shown in Fig. 3.9. The same general trend is apparent, namely that the 
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Fig. 3.7. Analytic function 
y=tanh[k(1 —x)] 
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higher-order formula only provides a substantial improvement when the grid is 
refined. 

When severe gradients occur, the magnitude of higher-order derivatives is 
much larger than that of low-order derivatives. Consequently, on a given grid 
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higher-order terms in the truncation error expression do not diminish at such a 
rapid rate as when the underlying exact solution is smooth. For the same reason, 
unless the grid is made very fine the magnitude of the higher derivative in the 
leading term of the truncation error may be so large for a higher-order discretis- 
ation that the overall error is comparable to that of a low-order discretisation. 

As a general comment, at least second-order discretisations should be used for 
reasons that are discussed in Sect. 9.4. The use of higher-order discretisations may 
be justified in special circumstances. 


3.4 Wave Representation 


Many fluid flow phenomena demonstrate a wave-like motion. Therefore it is 
conceptually useful to consider the exact solution as though it were broken up into 
its separate Fourier components. This raises the question of whether the discretis- 
ation process represents waves of short and long wavelength with the same 
accuracy. 


3.4.1 Significance of Grid Coarseness 


The finite difference method replaces a continuous function g(x) with a vector of 
nodal values (g;) corresponding to a vector of discrete grid points (x;). The choice of 
an appropriate grid spacing Ах is dependent on the smoothness of g(x). A poor 
choice is illustrated in Fig, 3.10a, and a reasonable choice is shown in Fig. 3.10b. To 
obtain an accurate representation of g(x) shown in Fig. 3.10a would require a much 
smaller grid spacing 4x than for g(x) shown in Fig. 3.10b. 


- 





хі x 


Fig. 3.10a, b. Discrete representation of а(х). Grid spacing too coarse (a) and reasonable (b) 


A Fourier representation of g(x) (assumed periodic) in the interval 0 S x € 27 is 


gx)= Y, ge", (3.28) 


т---со 


where і=(— 1)1/2, m is the wave number and g,, is the amplitude of the Fourier 
mode of wavelength 4—2z/m given by (Hamming 1973, p. 509) 


1 2x _ | 
$57 5- | Фде "ах. (3.29) 
9 


62 3. Preliminary Computational Techniques 


The vector of nodal values, (9;), can also be given a Fourier representation. This has 
the form 


J 
gj7 Ў g,e"4* , | (3.30) 
mai 


where the modal amplitudes g,, are given by 


J 
g,— Ax У gje "i47 . (3.31) 
ізі 
The discrete nature of the grid restricts the range of wavelengths that can be 
represented. In particular, wavelengths shorter than the cut-off wavelength А = 24х 
cannot be represented. Consequently (9;) should be interpreted as a long-wave 
approximation of g(x). Similarly, T5*!, the approximate solution obtained from 
(3.5) should be considered a long-wave approximation to the exact solution of (3.1). 
This aspect is considered further in Sect. 9.2 and is exploited in multigrid methods 
(Sect. 6.3.5). 


3.4.2 Accuracy of Representing Waves 


The accuracy of finite difference approximations, when wave-like motion is to be 
expected, may be assessed by application to progressive waves such as 


T(x, t) = 8 (em) — cos[m(x —qt)] , (3.32) 


where 1= (— 1)!/?, R denotes the real part, m is the wave number, as in (3.28), and 4 
is the propagation speed of the wave which is moving in the positive x direction. Át 
a fixed point x, the wave motion is periodic with a period P — 2z/(qm). 

At the ( j, n)-th node, the exact value of the first and second derivatives of T are 


ôT 


ao" sin[m(x, —4t,)] , (3.33) 
= 
ot = — т? соз[т(х,— 4,)] . - (3.34) 


Substitution of (3.32) into the three-point formula 


oT "Tiha T5.) 
дх |; 24x 


gives 


oT |" 1 
E 7544 Cost [m(x;— qt,)] + mAx) — cos([m(x; —qt,)]] ^ m4x]) 


- P (3.35) 


3.4 Wave Representation 63 
Thus the amplitude ratio of the first derivative representation is 


АК(1)= [27/дх]) _ (тах) 


эрт [0Т/0х] тах (3.36) 


Making use of (3.32), the central difference approximation to 0?7/0x? gives 


T5-,—2T5 T^, i A 2 
Deita - ес) cos[mí(x;— qt,)] . (3.37) 


The amplitude ratio of the second derivative representation is 


 fsin(Q.5mAx) V 


An examination of (3.36) shows that the use of the finite difference approxi- 
mation has introduced a change in the amplitude of the derivative. For long 
wavelengths, that is д> 204х, the amplitude of the first derivative is reduced by a 
factor between 0.984 and 1.000 in using the centred difference approximation. 
However, when there are less than 4 grid spacings in one wavelength (short 
wavelengths) the amplitude of the derivative is less than 0.64 of its correct value. 
For a wavelength of 4=204x, the centred difference representation of 02Т/дх? 
reduces the amplitude by 0.992. However, at a wavelength of 4 = 2 4х the amplitude 
of the second derivative is reduced Бу. 0.405. As noted in Sect. 3.4.1, long wave- 
lengths are represented more accurately than short wavelengths. 

When the forward difference approximation to [0TJOxY., (3.19), is compared 
with the exact value of the derivative, for T given by (3.32), it is found that errors are 
introduced in both phase and amplitude. The true amplitude is multiplied by the 
factor [sin(mAx/2)/(mAx/2)] and the phase is decreased by mAx/2, which is equiv- 
alent to a spatial lead of Ах/2. For the above examples the amplitude and phase 
errors disappear as Ах->0, i.e. the long wavelength limit. 


3.4.3 Accuracy of Higher-Order Formulae 


In Sect. 3.4.2 it was indicated that the accuracy of discretisation could be assessed 
by looking at a progressive wave travelling with constant amplitude and speed, 4. 
The exact solution is given by (3.32). Here this example will be used to see if higher- 
order difference formulae represent waves more accurately than low-order for- 
mulae. Specifically, a comparison will be made of the symmetric three-point and 
five-point formulae for 2Т/дх and 0?T/0x? given in Tables 3.1 and 3.2. 

Following the same development as for (3.36) the amplitude ratio for the five- 
point symmetric representation for 2Т/дх (Table 3.1) is 





4 1 sin máx 
=|- ~- , 3. 
AR (1) ($ усовтах mA (3.39) 


The long and short wavelength behaviour of (3.39) is shown in Table 3.5. The 
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Table 3.5, Amplitude ratios for a progressive wave 


Amplitude ratio 


Amplitude ratio 


Derivative Scheme Long wavelength Short wavelength 
(A=20 Ax) (A=4Ax) 

ат ЗРТ SYM 0.9836 0.6366 

ах 5РТ SYM 0.9996 0.8488 

Derivative Scheme Long wavelength Short wavelength 
(A = 20Ax) (A =2Ax) 

ФТ ЗРТ ЗУМ 0.9918 0.4053 

4х? SPT SYM 0,9999 0.5404 


amplitude ratio for the five-point representation of 0?7/0x? (Table 3.2) is given by 


en) (3.40) 


4 
RQ) - (1-025 [cos(0.5ms) ( КЕТЕР 


The long and short wavelength behaviour of (3.40) is shown in Table 3.5. The 
results shown in Table 3.5 indicate that both schemes are more accurate for long 
wavelengths, with the five-point scheme being particularly accurate. However, 
neither scheme is very accurate for short wavelengths. This result is consistent with 
the previous comments (Sect. 3.3) about the relative lack of advantage in using a 
higher-order scheme on a coarse grid. 

For a given wave being modelled, refining the grid (i.e. allowing more points to 
represent each wavelength) shifts the problem from a short wavelength to a long 
wavelength problem. Consequently the main conclusion to be drawn from Table 
3.5 is the need to use a sufficiently refined grid to accurately compute wavelike 
motions. 


3.5 Finite Difference Method 


As implied in Sect. 3.1 the basis for the finite difference method is the construction 
of a discrete grid (Fig. 3.2), the replacement of the continuous derivatives in the 
governing partial differential equations with equivalent finite difference expressions 
and the rearrangement of the resulting algebraic equation into an algorithm, as in 
(3.5). In this section the above aspects are linked together and a simple finite 
difference program is provided. 


3.5.1 Conceptual Implementation 


The various steps in applying the finite difference method to a given problem are 
represented schematically in Fig. 3.11. 
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Fig. 3.11. Schematic of the finite difference solution process 
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The procedure can be made specific by applying it to the following transient 
heat conduction (diffusion) problem. An insulated rod (Fig. 3.12) initially has a 
temperature of T(x, 0) =0°С. At t=0 hot reservoirs (Т = 100°C) are brought into 
contact with the two ends, A and B. The problem is to find (numerically) the 
subsequent temperature T(x, t) of any point in the rod. 

The governing equation for this problem is (3.1) 

oT 971. 

& ax? 
For the solution of this equation, the initial conditions (3.3) serve to initialise the 
dependent variables Т) (Fig. 3.2). One of the simplest finite difference analogues of 
the diffusion equation is obtained by replacing the time derivative with a forward 
difference approximation and the spatial derivative with a centred difference 
approximation (as in Sect. 3.1.1). This leads, via (3.5), to the forward time, centred 
space (FTCS) scheme 


0. 


Tj'!zsT*5 1-(1-28)7;--57!,, (3.41) 


where s = a4At/Ax?. Equation (3.41) is applied at all internal nodes j =2,..., J— 1. 
In a typical heat conduction problem the boundary values Т"*! and Т"*! are 
given by the boundary conditions (3.2). The solution process is repeated, advancing 
іп time (п=1,2,...), until the required final time is reached. 
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3.5.2 DIFF: Transient Heat Conduction (Diffusion) Problem 


In Sect. 3.5.1 the implementation of the finite difference method is described 
qualitatively; here a corresponding computer program, DIFF, is provided. DIFF 
applies the FTCS scheme, (3.41), to the transient heat conduction (diffusion) 
problem shown in Fig. 3.12. 

The listing of program DIFF is shown in Fig. 3.13 and the various parameters 
used by DIFF are described in Table 3.6. 


1 
2 с SOLVES 10 TRANSIENT HEAT CONDUCTION EQUATION USING 
3c FTCS SCHEME 

4c 

5 DIMENSION TN(41),DUM(41),TD(41),X(41), TE(41) 

вс 

7< INPUT DATA: 

8c 

9c JMAX = THE NUMBER OF POINTS ALONG THE ROD 

10 C MAXEX - THE NUMBER OF TERMS IN TEE EXACT SOLUTION 
11€ ИМАХ = THE MAXIMUM NUMBER OF TIME STEPS 

12 € ALPH = THE THERMAL DIFFUSIVITY 

13 С 5 = ALPH*DELT/DELZ/DELZ 

14 C TMAX = THE MAXIMUM TIME 

15 € 

16 OPEN (1,FILEs'DIFF.DAT!) 

17 OPEN (6,FILEs^DIFF.OUT') 

18 READ(1,1) JMAX , МАХЕХ , ИМАХ, ALPH, S, TMAX 

19 1 FORMAT(3IS,E10,3,F5.2,F5.0) 

20 PI = 3.1415927 

21€ 

22 C TD = DIMENSIONAL TEMPERATURE 

23 C ТИ = NONDIMENSIONAL TEMPERATURE 

24 C 

25 ЈМАР = МА - 1 

26 AJM = ОМАР 

27 DELX = 1./AJM 

28 DELT = DELX*DELX*S/ALPH 

29 WRITE(6,2) МАХ ‚МАХЕХ, ИМАХ, TMAX 

30 2 FORMAT(^ JMAXS^,I5,' MAXEX-',I5,' ИМАХ=°,15,' ТИАХ-!,Е8.2) 
31 WRITE(6,3)S,ALPH, DELT, DELI ' 

32 3 FORMAT(’ Ss',F5.3,' ALPH z/,E10.3," DELT z',E10.3, 
33 1: DELI z',E10,3,//) 

34 WRITE(6,4)S 

35 4 FORMAT(’ FTCS(EXPLICIT) SCHEME,',5I,' S z',F5.3,//) 
36 C 

37 C SET INITIAL CONDITIONS 

38 C 

39 DQ 5 J * 1,JMAP 

40 5 TH(J) = 0. 

41 reo 

42 T=0 

43 SJ = 1,0 - 2.05 
44 C 

45 C SET BOUNDARY CONDITIONS 
46 C EACH TIME STEP STARTS AT STATEMENT 6 
47 С 
48 6 TH(1) = 1. 
49 ТИ(ЈМАХ) = 1. 
50 IF(T .LT. 0.01)TH(1) = 0.5 
51 IF(T ИТ. 0.01)TE(JMAX) = 0.5 
52 TD(1) = 100.*ТИ(1) 


53 TD(JMAX) = 100.*Ти (МАХ) Fig. 3.13. Listing of program DIFF 


3.5 Finite Difference Method 67 


54 C 
55 С COMPUTE F.D. SOLUTION 

56 С 

57 DO 7 J = 2, МАР 

58 DUM(J) = SJ«TE(J) + S*(TNE(J-1) + ТИ(Ј+1)) 
59 Т CONTINUE 

60 208 J = 2,JMAP 

61 8 TK(J) = DUM(J) 

62 С 

63 DO9 J = 2,JMAP 

64 9 TD(J) = 100, *TH(J) 

65 T = T + DELT 

66 WRITE(6, 10)T, (TD(J) ,Jz1, JMAX) 

67 10 FORMAT(? T= ',F5.0," TD-',11F6.2) 

68 C 

69 C IF MAXIMUM TIME OR MAXIMUM NUMBER OF TIME~STEPS EXCEEDED 
70 C EXIT FROM LOOP 

71 С 

72 ІР(И .GE. ИМАХ)СОТО 11 

73 IF(T .LT. ТМАХ)СОТО 6 

74 C 

75 C OBTAIN EXACT SOLUTION AND COMPARE 

76 C 

77 11 SUM = 0, 

78 DO 13 J = 1,JMAX 

79 АЈ = J-1 

80 X(J) = DELI*AJ 

81 TE(J) = 100, 

82 DO 12 M = 1, МАХЕХ 

83 AM = М 

84 DAM = {2.»АИ - 1.) 

85- DIM = DAM*PI*%(J) 

86 DTM = -ALPHSDAMSDAM*PI*PIST 

87 C 

88 С LIMIT THE ARGUMENT SIZE OF EXP(DTM) 

89 C 

90 IF(DTM .LT. ~ 87.)DTM = -87,0 

91 12 TE(J) = TE(J) - 400. /DAM/PI*SIN(DIM)*EXP(DTM) 7 
92 SUM = SUM + (TE(J) - Т0(Ј)) 92 

93 13 CONTINUE 

94 WRITE (6,14)T, (TE(J), 2=1, JMAX) 

95 14 FORMAT(/,? T= ',F5,.0,' TE=’,11F6,2,//) 
96 С 

97 C RMS IS THE RMS ERROR 

98 C 

99 AVS = SUM/(1. + AJM) 

109 RMS = SQRT(AVS) 

101 WRITE(6,15)RMS 
102 15 FORMAT(’ RMS DIF = ',E11.4,//) 

103 STOP 

104 END Fig. 3.13. (cont) Listing of program DIFF 


Program DIFF reads in the various control parameters (line 16), sets Ax and At 
and writes out the various parameters. The solution is computed at each time step 
n4 1 (Fig. 3.11) until either NMAX or TMAX (lines 72, 73) are exceeded. Then the 
exact solution is computed (lines 74-91) from 


MAXEX 400 . БЕРЕКЕНІ 
ТЕ(х,, І,)-100- > [сез аот tex Je adm “”. (3.42) 


This solution is obtained by the separation of variables approach (Sect. 2.5.2). 
Simultaneously the rms error is obtained from 
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Table 3.6. Parameters used in program DIFF 





Parameter Description 
JMAX Number of points along the rod 
MAXEX Number of terms in the exact equation 
NMAX Maximum number of time steps 
ALPH Thermal diffusivity « 
5 aAt/Ax? 
TIMAX Maximum time 
TD Dimensional temperature array 
TN Nondimensional temperature array 
TE Exact (dimensional) temperature array 
DX Ах 
DT At 
TIM Time 
X Location, 0<x<1.0 
RMS rms error in the solution 
JMAX 1/2 
RMS= ( У т-ту) / max | . (3.43) 
j=1 


For the initial conditions T(x, 0)=0 and boundary conditions T(0, )= Т(1, t) 
= 100 the computer output generated by DIFF, corresponding to s=0.5, is given 
in Fig. 3.14 and plotted in Fig. 3.15. 

The value of s is used in DIFF to set the time step At; the value of 4x —0.1. 
Reducing s reduces ДЕ and leads to a smaller rms error in the solution. This is 
apparent from the results shown in Table 3.7 with T(0, 0) = Т(1, 0) = 50. The result 
corresponding to 5= 0.167 is a special case which is discussed further in Sect. 4.1.2. 

А consideration of lines 50 and 51 of DIFF (Fig. 3.13) indicates that the values 
of T(0,0) and T(1,0) are averaged between their values implied by the initial 


JMAX= 11 МАХЕХ= 10 NMAX- 500 TMAX= 2999.00 
S= .500 ALPH = .100E-04 DELT = .500E+03 DELX = .100Е%00 


FTCS(EXPLICIT) SCHEME, $ 2.500 


T= 500. Тра 50.00 25.00 .00 .00 .00 .00 .00 .00 .00 25.00 50.00 
Та 1000, TD#100.00 50.00 12.50 .00 .00 .00 .00 .00 12.50 50.00100.00 
T= 1500. TD=100.00 56.25 25.00 6.25 .00 .00 .00 6,25 25.00 56.25100.00 
T= 2000. Тр=100.00 62.50 31.25 12.50 3.13 .00 3,13 12.50 31.25 62.50100.00 
T= 2500. 70=100.00 65.63 37.50 17.19 6,25 3.13 6,25 17.19 37.50 65,63100.00 
T= 3000. TD#100.00 68.75 41.41 21.88 10.16 6.25 10.16 21.88 41.41 68.75100.00 


T= 3000. ТЕ=100.00 68.33 41.53 22.49 11.68 8.25 11.68 22.49 41.53 68.33100.00 


RMS DIF = .9418E+00 


Fig. 3.14. Typical output from DIFF; s=0.5 
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100 Fig. 3.15. Solution of (3.41) with s=0.5: stable 
T ; behaviour 
50 
t» 3000 
9 0,5 х 10 


conditions and those implied by the boundary conditions. The effect of setting 
T(0, 0) = Г, 0) 20, (i.e. as given by the initial conditions) is also shown in Table 
3.7. Clearly this leads to a much larger solution error. If T(0, 0) = T(1, 0) = 100 the 
error is of a similar magnitude to that for T(0, 0) = T(1, 0) 2 0. 


Table 3.7. rms error for t= t max 


rms error rms error - 
5 Т(0, 0) = 7 (1,0)-50. Т(0,0)- 7(1,0)=0 . 
0.5 0.492 2.251 
0.3 0.187 |. 0.953 
0.167 0.00169 0.440 


3.6 Closure 


Discretisation is seen to introduce an error that сап be assessed by considering the 
truncation error, at least for the finite difference method. The truncation error is 
likely to be a progressively more accurate indicator of the solution error as the grid 
is refined (Sect. 4.1.2). 

The particular formulae used to represent derivatives can be chosen by elimin- 
ating successive terms from the truncation error (Sect. 3.2.2). However such an 
approach often needs to be modified to choose some of the coefficients so that the 
resulting algorithm is stable (Sect. 4.3). 

It is apparent from the specific examples considered in Sects. 3.3 and 3.4 that 
higher-order formulae are not likely to be significantly more accurate than low- 
order formulae if the exact solution contains discontinuities or severe gradients. In 
addition, in representing wave-like motion, short wavelengths are predicted less 
accurately than long wavelengths. 

The provision of a simple finite difference program, DIFF, permits many of the 
features of the finite difference method to be illustrated. By way of example it is 
shown that the accuracy of the finite difference solution of the diffusion equation is 
quite sensitive to the way that the discontinuity in the boundary and initial 
condition specification is handled. Computational solutions of the diffusion equa- 
tion for different schemes and boundary conditions are discussed in Chap. 7. 
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3.7 Problems 


Approximation to Derivatives (Sect. 3.2) 
31 Use the general technique (Sect. 3.2.2) to determine the coefficients a to d in 
the formula 
2 


d*T 
d WAT + PT, + CT js +аТ 3 


What is the truncation error of this formula? 
32 A three-level explicit discretisation of 0T/0t — «0? T/0x? =0 can be written 


0.5Т/71—-2Т]+15Т]'! 


4 
ЕЕ тл" атаа) 
Ax? Ax? ' 


(a) Expand each term as a Taylor series to determine the truncation error of 
the complete equation for arbitrary values of d. 

(b) Use the general technique to choose d as a function of s so that the scheme 
is fourth-order accurate in Ax. The formulae after (4.11) are useful for this 
problem. 


Accuracy of Discretisation (Sect. 3.3) 


3.3 For y=sinzx/2 obtain dy/dx at х=0.5 with Ax —0.1 using: 


dy Yjei7JYj-1 
а) — 2-4 
(a) dx 2Ax , 


dy у,і-Уу; 
(b) dx Ax * 


(c) dy „0-27 8yj-i +8У,+1—У,+2 

ах 124х 
and compare the accuracy of the results. 

3.4 Repeat Problem 3.3 with Ax = 0.05, 0.025, 0.0125 and determine whether the 
convergence with Ax is consistent with the leading term of the truncation 
error. It is recommended that a computer program is written to solve this 
problem, Double precision may be required to resolve the error associated 
with scheme (c). 

3.5 For y=sinzx/2 obtain 42у/4х? at x 20.5 with 4x —0.1, 0.05, 0.025, 0.0125 
using: 


d'y yj.1—2yjt уу+1 
x? Ax? , 
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(b) Фу . —yj;-at 16y;-, — 30у;+ 16у;. 1— Vj+2 
dx? 12Ax? ' 


апа compare the accuracy of the results and the convergence with the leading 
term in the truncation error. It is recommended that a computer program is 
written to solve this problem, using double precision if required. 


Wave Representation (Sect. 3.4) 
36 For a progressive wave 
T (x, t) 2cos[m(x — qt] 


show that the use of 27/01 ( T7* ! — Т") АЕ is equivalent to 


oT 2\. At 
$^ (5) зона] x- (i) |) , 


and that the use of 07/0: =(1.51Т"*1—2Т"-+0.5Т" ')/At is equivalent to 


oT 1\. i а 
x (онота зеп и] x - д; +2) |} 


Show that, in the limit sin(0.5mgAt)0.5mq4t and cos(0.5mqAt) 1.0, the 
above two expressions are almost equivalent. Show further that, in 
the limit 0.5mq4t —^0, the above expressions coincide with the exact 
evaluation of 2Т/ді. 


Finite Difference Method (Sect. 3.5) 


37 Modify program DIFF to use the following five-point symmetric scheme 
for interior points: 


T —Ty-2/12+4T)— ,/3- 257, 4T;, 3 Tj, 4/12 
дх? Ах? 


For j=2 use 


OTT, V/12-5T,305T,, 9 Tj 3/37 T,,,/12 


ax? Ax? 


and the equivalent formula for j2 JMAX - 1. 
(a) Obtain solutions with 4x —0.1 for s=0.3, 0.2, 0.1. 
(b) How does the solution accuracy compare with that of the FTCS scheme? 
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3. Preliminary Computational Techniques 


Modify program DIFF to introduce the following scheme, in place of FTCS 
scheme: 


0. 
At Ax? 


For the first time step use the forward time formula 07/0 ~(T"*1—T")/ At. 

(a) Obtain solutions with 4x 20.1 for s=0.5, 0.3, 0.1 and compare with the 
solutions produced by the FTCS scheme. 

(b) Is the accuracy behaviour with decreasing s what you would expect? 
[Hint: Consider the truncation error expression in Problem 3.2.] 


4, Theoretical Background 


In practice the algebraic equations that result from the discretisation process, 
Sect. 3.1, are obtained on a finite grid. It is to be expected, from the truncation 
errors given in Sects. 3.2 and 3.3, that more accurate solutions could be obtained on 
a refined grid. These aspects are considered further in Sect. 4.4. However for a given 
required solution accuracy it may be more economical to solve a higher-order finite 
difference scheme on a coarse grid than a low-order scheme on a finer grid, if the 
exact solution is sufficiently smooth. This leads to the concept of computational 
efficiency which is examined in Sect. 4.5. 

An important question concerning computational solutions is what guarantee 
can be given that the computational solution will be close to the exact solution of 
the partial differential equation(s) and under what circumstances the compu- 
tational solution will coincide with the exact solution. The second part of this 
question can be answered (superficially) by requiring that the approximate (compu- 
tational) solution should converge to the exact solution as the grid spacings At, Ax 
shrink to zero (Sect. 4.1). However, convergence is very difficult to establish directly 
so that an indirect route, as indicated in Fig. 4.1, is usually followed. The indirect 
route requires that the system of algebraic equations formed by the discretisation 
process (Sect. 3.1) should be consistent (Sect. 4.2) with the governing partial differ- 
ential equation(s). Consistency implies that the discretisation process can be 
reversed, through a Taylor series expansion, to recover the governing equation(s). 
In addition, the algorithm used to solve the algebraic equations to give the 
approximate solution, Т, must be stable (Sect. 4.3). Then the pseudo-equation 


CONSISTENCY + STABILITY = CONVERGENCE (4.1) 
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Discretisotion SYSTEM 











PARTIAL Or 
Ом у ALGEBRAIC 
(7) а Consistency EQUATIONS 













L| Stobility 






EXACT Convergence APPROXIMATE 
SOLUTION ss, Ax, M-»0 SOLUTION 





Fig. 4.1. Conceptual relationship between consistency, stability and convergence 
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is invoked to imply convergence. The conditions under which (4.1) can be made 
precise are given by the Lax equivalence theorem (Sect. 4.1.1). 

It is very difficult to obtain theoretical guidance for the behaviour of the 
solution on a grid of finite size. Most of the useful theoretical results are strictly 
only applicable in the limit that the grid size shrinks to zero. However the 
connections that are established between convergence (Sect. 4.1), consistency 
(Sect. 4.2) and stability (Sect. 4.3) are also qualitatively useful in assessing compu- 
tational solutions on a finite grid. 


4.1 Convergence 


A solution of the algebraic equations (Fig. 4.1) which approximate a given par- 
tial differential equation is said to be convergent if the approximate solution 
approaches the exact solution of the partial differential equation for each value of 
the independent variable as the grid spacing tends to zero. Thus we require 
T^ T (x; ta), аз Ах, At0. 

The difference between the exact solution of the partial differential equation 
and the exact solution of the system of algebraic equations is called the solution 
error, denoted by e^; that is 


"=T(x,,t,)—T? . (4.2) 


The exact solution of the system of algebraic equations is the approximate solution 
of the governing partial differential equation. The exact solution of the system of 
algebraic equations is obtained when no numerical errors of any sort, such as those 
due to round-off, are introduced during the computation. The magnitude of the 
error, e^, at the ( j, n)-th node typically depends on the size of the grid spacings, 4х 
and At, and on the values of the higher-order derivatives at that node, omitted from 
the finite difference approximations to the derivatives in the given differential 
equation. 

Proof that a solution to the system of algebraic equations converges to the 
solution of the partial differential equation is generally very difficult, even for the 
simplest cases. For the approximate solution to the diffusion equation, using the 
very simple FTCS algorithm (3.41), a proof of convergence for sx 4 is given by 
Noye (1984, pp. 117-119). Convergence is very difficult to show when the given 
partial differential equation is more complicated than the diffusion equation and 
the method of discretisation is less direct. 

А few flow problems possess exact solutions so that, for these cases, conver- 
gence can be inferred by obtaining computational solutions on progressively 
refined grids (Sect. 4.1.2). 


4.1.1 Lax Equivalence Theorem 


For a restricted class of problems convergence can be established via the Lax 
equivalence theorem (Richtmyer and Morton 1967, p. 45): "Given a properly posed 
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linear initial value problem and a finite difference approximation to it that satisfies 
the consistency condition, stability is the necessary and sufficient condition for 
convergence”. 

Although the theorem is expressed in terms of a finite difference approximation 
it is applicable to any discretisation procedure that leads to nodal unknowns, e.g., 
the finite element method. The Lax equivalence theorem is of great importance, 
since it is relatively easy to show stability of an algorithm and its consistency with 
the original partial differential equation, whereas it is usually very difficult to show 
convergence of its solution to that of the partial differential equation. 

Most “real” flow problems are nonlinear and are boundary or mixed initial/ 
boundary value problems so that the Lax equivalence theorem cannot always be 
applied rigorously. Consequently the Lax equivalance theorem should be inter- 
preted as providing necessary, but not always sufficient, conditions. In the form of 
(4.1) the Lax equivalence “equation” is useful for excluding inconsistent discretis- 
ations and unstable algorithms. 


4.1.2 Numerical Convergence 


For the equations that govern fluid flow, convergence is usually impossible to 
demonstrate theoretically. However, for problems that possess an exact solution, 
like the diffusion equation, it is possible to obtain numerical solutions on a 
successively refined grid and compute a solution error. Convergence implies that 
the solution error should reduce to zero as the grid spacing is shrunk to zero. 

For program DIFF (Fig. 3.13), solutions have been obtained on successively 
refined spatial grids, Ax — 0.2, 0.1, 0.05 and 0.025. The corresponding rms errors are 
shown in Table 4.1 for s=0.50 and 0.30. It is clear that the rms error reduces like 
Ax* approximately. Based on these results it would be a reasonable inference that 
refining the grid would produce a further reduction in the rms error and, in the 
limit of Ах (for fixed s) going to zero, the solution of the algebraic equations would 
converge to the exact solution. 

The establishment of numerical convergence is rather an expensive process 
since usually very fine grids are necessary. As s is kept constant in the above 
example the timestep is being reduced by a factor of four for each halving of Ах. In 
Table 4.1 the solution error is computed at t = 5000 s. This implies the finest grid 
solution at s—0.30 requires 266 time steps before the solution error is computed. 

For the diffusion equation (3.1) with zero boundary values and initial value 
T (x, 0)=sin(ax), O€ xx 1, the rms solution error [elms is plotted against grid 


Table 4.1. Solution error (rms) reduction with grid refinement 


rms error 
s-aAt/Ax? 4х=0.2 Ax=0.1 4х=0.05 4х = 0.025 
0.50 1.658 0.492 0.121 0.030 


0.30 0.590 0.187 0.048 0.012 
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logio (Ax) Fig, 4.2. Numerical convergence for the FTCS method 
-20 -1.8 “1.6 -.4 -.2 0 





spacing 4x in Fig. 4.2. The increased rate of convergence (fourth-order conver- 
gence) for s=4, compared with other values of s €1 (second-order convergence), is 
clearly seen, i.e. the convergence rate is like 4х“ for s=4, and like Ах? otherwise. As 
will be demonstrated in Sect. 4.2, the superior convergence rate for з=} is to be 
expected from a consideration of the leading term in the truncation error. Typi- 
cally, for sufficiently small grid spacings Ax, At, the solution error will reduce like 
the truncation error as Ax, At—0. 


4.2 Consistency 


The system of algebraic equations generated by the discretisation process is said to 
be consistent with the original partial differential equation if, in the limit that the 
grid spacing tends to zero, the system of algebraic equations is equivalent to the 
partial differential equation at each grid point. 

Clearly, consistency is necessary if the approximate solution is to converge to 
the solution of the partial differential equation under consideration. However, it is 
not a sufficient condition (Fig.4.1), for even though the system of algebraic 
equations might be equivalent to the partial differential equation as the grid 
spacing tends to zero, it does not follow that the solution of the system of algebraic 
equations approaches the solution of the partial differential equation. For instance, 
choosing s>4 in program DIFF causes the solution using the FTCS algorithm 
(3.41) to diverge rapidly. Thus as indicated by the Lax equivalence theorem (Sect. 
4.1.1), both consistency and stability are required for convergence. 

The mechanics of testing for consistency requires the substitution of the exact 
solution into the algebraic equations resulting from discretisation, and the expan- 
sion of all nodal values as Taylor series about a single point. For consistency the 
resulting expression should be made up of the original partial differential equation 
plus a remainder. The structure of the remainder should be such that it reduces to 
zero as the grid is refined. 


4.2 Consistency т 


In this section the ЕТС5 scheme (бесі. 4.2.1) and the fully implicit scheme 
(Sect. 4.2.2) will be analysed to see if they are consistent discretisations of the 
diffusion equation (3.1). 


4.2.1 FTCS Scheme 


The FTCS scheme (3.41) was used in Sect. 3.5.2 to obtain the solution of the one- 
dimensional diffusion equation. The substitution of T}, the exact solution of the 
diffusion equation at the (j, n)-th node, for the approximation Tj gives 


Tit =sTh_,+(1-2s) Te +51"... (4.3) 
We wish to determine how closely (4.3) corresponds to the diffusion equation (3.1) 


at the (j, n)-th node. Substitution of Taylor series expansions about the ( j, n)-th 
node for each term of (4.3) produces, on simplification, 


ôT eT 
-- n h . 
ЕЕЕ | | +Е)=0 , where (44) 
02Т eT 2 44 
Ej-osa| 77 | - - i2 ЗЕ | T O(At^, Ax^) . (4.5) 


It can be seen that the partial differential equation (4.4) differs from the diffusion 
equation (3.1) by the inclusion of the extra terms contained in E5, which is called 
the truncation error. The extra terms can be identified with the discretisation of the 
derivatives 27/01 and 02Т/дх? by (3.21) and (3.25), respectively. 

Clearly, as the grid spacings become smaller and smaller, the truncation error 
Ең tends to zero at the fixed point (x,, t,). In the limit as Ax, 4t20, the FTCS 
algorithm (3.41) is equivalent to the diffusion equation. This property is called 
consistency. 

For the FTCS algorithm the truncation error Е” is generally O(At, Ax?). 
However, since T satisfies the diffusion equation (3.1) it also satisfies 


OT д OT 9. 2. „Т 


=x 4.6 
227 dtdx? dx? ot ох (4-6) 
Therefore the truncation error may be written 
1\[ o*T 
E5—0.5a4 0-4) | +0(42?, Дх“). (4.7) 


If s=4, the first term іп the above expression vanishes and the truncation error is 
O(AC, Ax*), or, what is the same thing for fixed s, O(Ax^). In this case, the 
truncation error goes to zero faster than for any other value of s as the grid spacing 
is reduced. It may be recalled from Sect. 4.1.2 that the solution error also goes to 
zero faster for s=4 than for any other value (Fig. 4.2). 
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From the above it is clear that the same algebraic manipulation permits both 
the consistency and the likely convergence rate to be established. 


4.2.2 Fully Implicit Scheme 


Consistency will be investigated here for the fully implicit discretisation (7.20) of the 
diffusion equation (3.1). The fully implicit scheme is 


—$1Т"+1+(14+2) Г" - T^?) - T7. (4.8) 


To demonstrate consistency it is easier to start from the equivalent form of (4.8), 
applied to the exact solution, 


T?* 1. Т" _„ (Tr! —2T;* 1 + TI) 


-0. 9 
At Ax? (49) 


First, the terms 71+] and T^11 аге expanded about the (j,2+1)-th node and 
substituted into (4.9). The result is 


Т.+1— n 
UST dear e (SE у, an (26 Jua -- |- 0. 
(4.10) 


Second, the terms Т"*1, [Т,,]"*!, etc. in (4.10) are expanded about the (j, n)-th 
node. The result is 


F = At? \ 
(77405490, «(S Тыл» 
7 Ax*\ _- 
-a[T,.]7 ЧЕТ, 1+ (0542) [T.,]7 ... +( 12 сту 


+ At(T ea, "+ oss aram .)...]=0. (4.11) 


From the governing equation 


T,=0aT,,, Tas Ta, Т,-оЗТ., | (4.12) 
and s=aAt/Ax? or Ах? =а 415. Therefore (4.11) can be simplified to 


where the truncation error is 


2 
Е" j= -osais =) (> Yr — goa) Ure - ғ... (414 
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It is clear that as At tends to zero, Е? tends to zero and (4.13) coincides with 
the governing equation. Consequently (4.8) is consistent with the governing 
equation. 

In (4.14) all spatial derivatives have been converted to equivalent time de- 
rivatives. Using (4.12) it would be possible to express the truncation error in terms 
of the spatial grid size and derivatives only, as in (4.7). A comparison of (4.14) and 
(4.7) indicates that there is no choice of s that will reduce the truncation error of the 
fully implicit scheme to O(4 x*). 

It might appear from the above two examples that consistency can be taken for 
granted. However, attempts to construct algorithms that are both accurate and 
stable can sometimes generate potentially inconsistent discretisations, e.g. the 
DuFort-Frankel scheme, Sect. 7.1.2. 


4.3 Stability 


This is the tendency for any spontaneous perturbations (such as round-off error) in 
the solution of the system of algebraic equations (Figs. 3.1 and 4.1) to decay. A 
stable solution produced by the FTCS scheme with s=0.5 is shown in Fig. 3.15. A 
typical unstable result (s=0.6) is shown in Fig. 4.3. These results have been 
obtained with 4x=0.1 and the same initial and boundary conditions as used to 
generate Fig. 3.15. 


Fig. 4.3. Solution of (3.41) with s=0.6; 
unstable behaviour | 





It is clear from Fig. 4.3 that ап unphysical oscillation originates оп the line of 
symmetry and propagates to the boundaries. The amplitude of the oscillation 
grows with increasing time. 

The concept of stability is concerned with the growth, or decay, of errors 
introduced at any stage of the computation. In this context, the errors referred to 
are not those produced by incorrect logic but those which occur because the 
computer cannot give answers to an infinite number of decimal places. In practice, 
each calculation made on the computer is carried out to a finite number of 
significant figures which introduces a round-off error at every step of the compu- 
tation. Hence the computational solution to (3.41) is not Tj*', but *T5*!, the 
numerical solution of the system of algebraic equations. 
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A particular method is stable if the cumulative effect of all the round-off errors 
produced in the application of the algorithm is negligible. More specifically, 
consider the errors 

5$-1;-'T, (4.15) 


J 

introduced at grid points ( j, п), where j=2, 3,..., J—1 and п=0, 1, 2. It is usually 
not possible to determine the exact value of the numerical error £5 at the (j, n)-th 
grid point for an arbitrary distribution of errors at other grid points. However, it 
can be estimated using certain standard methods, some of which will be discussed 
in this section. In practice, the numerical solutions are typically more accurate than 
these estimates indicate, because stability analyses often assume the worst possible 
combination of individual errors. For instance, it may be assumed that all errors 
have a distribution of signs so that their total effect is additive, which is not always 
the case. 

It can be shown that, for linear algebraic equations produced by discretisation, 
the corresponding error terms satisfy the same homogeneous algebraic equations 
as the values of T. For instance, using the FTCS scheme (3.41) means that we are 
actually calculating *75* ! using *T5. ,, *Т and *15,,, so that 


*T^* 1 -s(*T^. «(01-29 (*T) &s(*T7,4) . (4.16) 


Substitution of (4.15) into (4.16), followed by application of (3.41), which applies 
since the exact solutions of the algebraic equations, 77, satisfy the FTCS algorithm, 
yields the homogeneous algebraic equation 


Cpr =5е]-1+(1—25)&7++56,ү. (4.17) 


Assuming given boundary and initial values, the initial errors, &, ј=2, 3,..., 
J — 1, and the boundary errors £1 and £5, п= 0, 1, 2, ... for this equation, will all 
be zero. Unless some (round-off) error is introduced in calculating the value of T} 
at some interior node, the resulting errors in the solution will remain zero. 

The two most common methods of stability analysis are the matrix method and 
the von Neumann method. Both methods are based on predicting whether there 
will be a growth in the error between the true solution of the numerical algorithm 
and the actually computed solution, i.e. including round-off contamination. 

Ап alternative interpretation of stability analysis is to suppose that the initial 
conditions are represented by a Fourier series. Each harmonic or mode in the 
Fourier series will grow or decay depending on the discretised equation, which 
typically furnishes a specific expression for the growth (or decay) factor for each 
mode. If a particular mode can grow without bound, the discretised equation has 
an unstable solution. This interpretation of stability (Richtmyer and Morton 1967, 
pp. 9-13) is exploited directly in the von Neumann method of stability analysis 
(Sects. 4.3.4 and 4.3.5). The unbounded growth of a particular mode is still possible 
if the discretised equations are solved exactly, 1.е. with no (round-off) errors being 
present. If (round-off) errors are introduced the same unstable nature of the 
discretised equations will cause unacceptable growth of the errors. Consequently 
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the procedures for analysing the stability of the discretised equations аге the same 
irrespective of the manifestation of the inherent stability or instability. 
43.1 Matrix Method: FTCS Scheme 


In this technique we express the set of equations governing the error propagation in 
matrix form and examine the eigenvalues of the associated matrix. Initially this 
method will be illustrated for the FTCS scheme (3.41). 

Substituting j=2, 3,..., J— 1 into (4.17) and noting that the errors at the 
boundary are zero, so that &1 —0 and €*=0 for all п, gives 


11-29454 , 

Est ost 4 (129 1+ 361 , 

НИ, _. (4.18) 
G*i-s0 40-298. . 


This may be written in matrix form as 


£'*!-468, п-0,1,..., | (4.19) 
where A is a (J 2) square matrix and 2" is a vector of length (J —2) defined by 
(1— 25) 5 ё" 
5 (1-25) s 
5 205 e _ 


PN 
II 
“к 
| 


$ (1-25) 67-1 
It can be shown that the errors č" are bounded as n increases if the eigenvalues A j of 


the matrix A are all distinct and have absolute values less than or equal to one; that 
is, if 








l4] $1, foralla. (4.20) 
Now the eigenvalues of the tridiagonal matrix 4 are, from (9.48), 
jn 
ц-1-аяа( Т). іс1,2,...,/-2. (4.21) 
The stability condition (4.20) permits only values of $ for which 
~1<1—4ssin?{—2" 151. К. 
& 45 sin (ааа (4.22) 


The right-hand side of this inequality is satisfied for all j and s, while the left-hand 
side requires 
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_ of іл <1 
ssn PEI 


which is true for all j if s < $. Тһе FTCS algorithm is therefore stable for s < 4. 
The form of (4.19) is the same if (4.18) is applied to the solution rather than the 
computational error. Consequently the matrix method can also be interpreted in 
terms of growth of the individual modes in a Fourier series representation of the 
initial conditions, 
Alteration of the nature of the boundary conditions at x=0 and/or х=1 only 
slightly modifies this method, as will be indicated in Sect. 4.3.3. 





4.3.2 Matrix Method: General Two-Level Scheme 


The matrix method will be demonstrated in this section for the general two-level 
scheme applied to the diffusion equation (7.24). That is 


Tj'1- TJ 


g CaL, TH?! а! B)L.,T7—0 , (4.23) 


хх * j 


where 1,.Т;-(Т).1-27;-7;У4х2. 

Іп (4.23) а is the thermal diffusivity and В is the parameter that controls the 
degree of implicitness (бесі, 7.2). The numerical error (4.15) is governed by ап 
equation equivalent to (4.23). This is 


— 582721 + (1+ 258) 9 -spiti = s(1—B) ё. + [172s(0 167 
+s(1— BE. (4.24) 


Equation (4.24) is appropriate to internal nodes. With Dirichlet boundary con- 
ditions no equations are required at the end-points. When (4.24) is repeated for all 


nodal points, j=2, 3,..., J—1, a matrix of equations can be written, 
Agé"*'!=Bé", where (4.25) 
1+258 -sf | 
—sB 1-25) -sf 
А= . . 
-5 1+258 -sf 
—sp 1+25В 
апа 


1—2s(1—f) 51-6) 
Я1-)  1—2s1—B) 51-0) 


51-8) 1—2s1—8) 1-8) 
51—68)  1-2s(1— f) 


4.3 Stability 83 


The algorithm (4.23) is stable if the magnitude of the eigenvalues of 4^ !B are 
bounded by unity. Because of the structure of 4 and B this is equivalent to 


(в), 
(Аа) 


Because of the symmetric tridiagonal nature of A апа В, analytic expressions are 
available for the eigenvalues (Mitchell and Griffiths 1980, p. 10): 








<10. (4.26) 


іл 
4-1 +256-2sp cos =) 





- i J^ d 
1--45 sin (27) ап 


А„=1-—25(1— f) +251 -pxos( Z) 


=1—45(1 еи 5) , 


Therefore stability requires 


1—4s(1 — f)sin? [jnx/2(J—1)] 
1 4- Asfsin? [jn/2U — 1)] 


it sin( jT Jes. ST=1 


ST= <1 








" п \_ _ 11-451 -В) 
i sin( 7 -1 , sr- 46-29 


ie. 1 — 4s(1— B) x 1-- 4s, which is satisfied апа 1—4s(1—f)z —1 —4sf so that 
2>45(1—28) or s€0.5/(1—2) if B «0.5. 

Thus if В «0.5, stability requires that sx 0.5/(1 —2f). If B20.5, (4.23) is un- 
conditionally stable. The desirable feature of unconditional stability is pursued in 
Sect. 7.2. 


51 


= ? 





4.3.3 Matrix Method: Derivative Boundary Conditions 


The matrix method can also be used when Neumann (derivative) boundary 
conditions occur. Suppose that 


ôT 
—=0, at х-0. (4.27) 
дх 

This сап be implemented Бу introducing a fictitious point Ту such that 





1 
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Applying this equation to the error distribution and combining with (4.24) gives the 
following equation centered at node 1: 


(1+25В) 61" '—2sB23* ' =[1— 25(1 — В) 161 + 2s(1 — В) 63 —2sdx . (4.28) 


Consequently when all the equations аге considered 


A&"*'=BéE"+C , where (4.29) 
1+2s8 | —2sB —2sóAx 
-5 1+258 -sÉ 0 
А- . . . ‚ C= 0 
—sp 1+258 —5В 0 
—sB 1-25 0 


1-241-)) 251—8) 
s1—8 — 1—2s1—B) 51-8) 


s1—8)  1—2s1—B) 51-08) 
1—6) — 1—2s(1—p) 


As before, stability requires that (4 ,-.5),] S 1.0. 


For the general case where explicit formulae are not available for the eigenval- 
ues of 4^ !B it is convenient to form 


g"+1=D"+4A7'C (4.30) 


where D=A~'B. 
The maximum eigenvalues of D can be found using the power method (Carna- 
han et al. 1969, p. 226). This requires evaluating 


Х"+1=рх" (4.31) 


until Х”*! coincides with X". After convergence X"'! is the eigenvector 
corresponding to the maximum eigenvalue, which is given by the maximum element 
of X"*!, It is usual to scale the maximum element of X" to unity at each 
iteration. The starting values, X?, are arbitrary. Carnahan, Luther and Wilkes 
provide a program and discuss possible difficulties with the method. 

For the generalised two-level scheme (4.23) the maximum eigenvalues of 
D=A~'B for both Dirichlet and Neumann boundary conditions at х=0 are 
shown in Table 4.2 for a range of values of В and s. These results were generated 
using the power method. For the Dirichlet boundary condition at x =0 the results 
are consistent with the requirement 


0.5 . 
5--- <0. 
51-58 if 8<0.5 
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Table 4.2. Numerical stability data for generalised two-level scheme 





Dirichlet b.c. Neumann b.c. 
at х=0 at х--0 
B 5 Max. eigenvalue Max. eigenvalue 
0.00 0.50 0.9660 0.9899 
0.51 — 1.0053 — 1.0289 
0.52 — 1.0446 ~ 1.0686 
0.30 1.20 — 0.9534 0.9753 
1.30 — 1.0161 —1.0273 
0.60 1.20 0.9221 0.9742 
1.30 0.9159 0.9721 


for stability (A,,,, € 1). It is clear that implementation of the Neumann boundary 
condition (4.27) slightly reduces the stability of the scheme for corresponding 
values of f and s. 


4.3.4 Von Neumann Method: FTCS Scheme 


The von Neumann analysis is the most commonly used method of determining 
stability criteria as it is generally the easiest to apply, the most straightforward and 
the most dependable. Unfortunately, it can only be used to establish necessary and 
sufficient conditions for the stability of linear initial value problems with constan 
coefficients. ` 

Practical problems typically involve variable coefficients, nonlinearities апа 
complicated types of boundary conditions. In which case the method can only be 
applied locally and with the nonlinearities temporarily frozen. For this more 
general situation the von Neumann method provides necessary, but not always 
sufficient, conditions for stability. Strictly it is only applicable for interior points. 
But the von Neumann method can provide heuristic information about the 
influence of derivative boundary conditions on the numerical stability if it is 
applied separately to the algorithms used at the boundaries (Trapp and Ramshaw 
1976). 

In the von Neumann method, the errors distributed along grid lines at one time 
level are expanded as a finite Fourier series. Then the stability or instability of the 
computational algorithm is determined by considering whether separate Fourier 
components of the error distribution decay or amplify in progressing to the next 
time level. Thus an initial error vector & is expressed as a finite complex Fourier 
series, so that at x, the error is 


7-2 
89-7 ае, j22,3,...,J-1, (4.32) 
=1 


where i2 (— 1)!? and 0, --тлАх. The Fourier series (4.32) has the same form as 
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(3.30) except that here the errors are assumed to be periodic in the interval of 
interest, O€ x € 1. As long as the computational algorithm is linear, it is sufficient to 
study the propagation of the error due to just a single term, ехр(10,/), of the 
Fourier series representation (4.32). Therefore, in the following, the subscript m has 
been dropped from ди. 

In view of the form of the initial error distribution, a solution of the "error" 
equation (4.17) appropriate to the FTCS scheme is sought in the separation-of- 
variables form as 


£z (Се , : (4.33) 
where the time dependence of this Fourier component of the error is contained in 


the complex coefficient (G)", and the superscript n implies С is raised to the power n. 
Substitution of (4.33) into the "error" equation (4.17) produces the equation 


(С)"* leiti — 6(б)теі80- Y4(1 _ 2s)(G)" e + s(G)' eU + 1) . 
Algebraic manipulation gives 
G —1— 4ssin?(8/2) . (4.34) 


The term G can be interpreted as the amplification factor for the mth Fourier mode 
of the error distribution as it propagates one step forward in time, since from (4.33) 


E" +1 

І--б. (4.35) 

e 
It may be noted that G is a function of s and 0. That is, G(s, 0) depends on the size 
of the grid spacing and the particular Fourier mode being considered, since 
sz-aAt/Ax? and 02 mnAx. The errors will remain bounded if the absolute value of 
G, also called the gain, is never greater than unity for all Fourier modes 0. Thus the 
general stability requirement is 


IG|s1, forall8. (4.36) 
From (4.34) the stability requirement for the FTCS scheme is 





-1<1-455іп2(0/251, forallé, (4.37) 


which is true if 55 1/2. This result is the same as found by the matrix method. 


4.3.5 Von Neumann Method: General Two-Level Scheme 


The von Neumann stability analysis will be illustrated here for the general two- 
level implicit scheme (7.24). The round-off errors associated with solving this 
scheme are propagated by the same equation, 


g-ü 


Up CBLQG'-a —p)L,,£;-0 , (4.38) 
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where 


Lag a 2t ёра 


Ax? , 
a is the thermal diffusivity and В is a parameter that controls the degree of 
implicitness. 

The von Neumann stability analysis proceeds by introducing a Fourier series 
(4.32) for the error between the true numerical solution and the numerical solution 
contaminated by round-off errors, &, defined by (4.15). Since (4.38) is linear, only а 
single Fourier component is introduced in the expression for £5, that is 


& -(Gy'e?, (4.39) 
where iz (— 1)? and j is the grid number. Substitution gives 


L,,U - (Gy'e*U- » — 2(Gye* + (Gy e*u* D 
а; 2(c088 — 1) 
= ft 210} 
(суге A . 


Consequently (4.38) becomes 


2(cos0 — 1) 


(бу + 19167 _ (Gy'eiei 
———————— — t 
Ax? 


T B(Gy* 1 е18} 


2.19; 2(cos 8 — 1) 
+ a(1 — p)(G) е} Ae 
Dividing through by (G)"e! gives 


[С — 1]/4t -aBG[2(cos 9 — 1)]/4 x? - a(1 — B) [2(cos @—1)] /4x? =0 
ог 
_ 1-45 — f)sin*(8/2) 


1 + 458 sin2(0/2) (4.40) 


For stability |G| < 1 for any choice of 6. For the limiting values 
віп(0/22-0, С=1 
sin(@/2)=1 , G=[1—4s(1-—f)]/[1 + 458] 
IfGs1, 1-—45(1-—8В)<1+45В or 5>0. 
If G-—1, 1-4s(1—8)z—1—4sB ог 8505/(1-2)). 


These restrictions agree with the result given in Sects. 4.3.2 and 7.2.2. For more 
complicated equations it may be necessary to establish stability by evaluating the 
equivalent of (4.40) for a range of values of 0, B and s, etc. This is often done 
numerically. 
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For algorithms that couple three time levels together, е.р. Sect. 7.2.3, a quad- 
ratic equation in G must be solved to obtain the stability restrictions. For most 
fluid flow problems a system of governing equations 1s required. The von Neumann 
stability analysis then leads to an amplification matrix G in place of the amplifi- 
cation factor б. The corresponding stability condition is 


14„1=1.0 for allm . (4.41) 


where 4, are the eigenvalues of б. 

In this section it is assumed that the underlying physical problem is stable. 
However it may happen that one needs to compute flows that are concerned with 
physical instability; for example transition from laminar to turbulent flow. For 
such flows a growing solution with time must be acceptable. This can be allowed 
for by requiring the magnitude of the eigenvalues in the matrix method (Sect. 4.3.1) 
and the amplification factor in the von Neumann method (Sect. 4.3.4) to be less 
than 1+ O(At). This condition replaces the inequalities (4.20, 36). This aspect is 
discussed further by Richtmyer and Morton (1967, p. 70). 

Stability of the discretised equations is discussed more fully by Mitchell and 
Griffiths (1980), Isaacson and Keller (1966) and Richtmyer and Morton (1967). It is 
generally accepted that the matrix method of determining stability is not so reliable 
as the von Neumann method. The reasons for this are discussed in relation to the 
one-dimensional transport equation (9.56) by Morton (1980) and Hindmarsh et al. 
(1984). The problem of stability of the discretised equations adjacent to boundaries 
lies beyond the scope of this book (see Sod 1985, Chap. 5). 


4.4 Solution Accuracy 


Strictly, the previous discussion of convergence, consistency and stability has been 
concerned with the behaviour of the approximate solution in the limit Ax, dt 0. 
However in practice approximate solutions are obtained on a finite grid and the 
corresponding accuracy is of considerable importance. 

The determination of the consistency, Sect. 4.2, produces an explicit expression 
for the truncation error. For an exact solution of the partial differential equation it 
is possible to evaluate the accuracy of the representation of the individual deriva- 
tive terms as in Sect.3.3. Combining the individual terms together gives an 
indication of how closely the algebraic equation agrees with the partial differential 
equation. As is to be expected from Sect. 3.3, evaluation of the leading term in the 
truncation error of the complete equation provides a close estimate of the error. 

As indicated in Sect. 4.2 the order of the truncation error will usually coincide 
with the order of the solution error if the grid spacings are sufficiently small and if 
the initial and boundary conditions are sufficiently smooth. It is common for the 
implied improvement in accuracy (from the truncation error) of higher-order 
schemes over say a second-order scheme, not to be achieved due to insufficient 
smoothness in the initial conditions for the particular grid. Refining the grid will 
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very often produce a superior accuracy for the higher-order scheme but at an 
absolute accuracy level that is far higher, and hence far more expensive, than is 
necessary. Thus an engineering accuracy of 1% in the rms error of the solution 
might be required. A fourth-order scheme may only demonstrate superior accuracy 
over a second-order scheme when the grid has been refined to the point where the 
rms error is perhaps 0.01%. An example of such an effect of non-smooth initial 
data on the accuracy is given by Fletcher (1983). 

For problems governed by hyperbolic partial differential equations, discontin- 
uities can form in the interior of the domain, and these effectively limit the accuracy 
that can be achieved, unless they are isolated from the rest of the domain and a 
local solution sought. 

One way of determining the accuracy of a particular algorithm on a finite grid is 
to apply it to a related but simplified problem, which possesses an exact solution. In 
this regard Burgers' equation (Sect. 10.1) is useful since it models the convective and 
diffusive terms in the momentum equations and since exact solutions can be 
obtained for many combinations of initial and boundary conditions. However 
accuracy is also problem dependent, and an algorithm which is accurate for a 
model problem may not necessarily be so accurate for the (more complicated) 
problem of interest. 

А second technique for assessing accuracy is to obtain solutions on successively 
refined grids (assuming the computing capacity is available) and to check that, with 
successive refinement, the solution is not changing to some predetermined accu- 
racy. This assumes that the approximate solution will converge to the exact 
solution in the limit At, Ax, etc.—0 and that the approximate solution on the finest 
grid can be used in place of the exact solution. Since this is usually impossible to 
guarantee for "real" problems, it is useful to compare the computational solutions 
with reliable experimental results (of known accuracy) for the same problem. For 
example, the pressure distribution on a body surface could be used to check that 
the refined grid solution is reasonable. However experimental data are usually not 
available in sufficient detail, for a particular problem, to permit an evaluation of a 
global rms error. 

Assuming that the accuracy of the approximate solution can be assessed it is 
important to consider the related question of how the accuracy may be improved. 
At the broadest level the answer to this question may lie in making a different 
choice for the dependent variables, e.g. vorticity and stream function instead of 
velocity and pressure. Alternatively, a different choice of independent variables 
may be appropriate. For example polar coordinates will produce more accurate 
solutions for pipe flow than Cartesian coordinates for the same number of grid 
points. 

At a more specific level the use of higher-order schemes or grid refinement 
would be expected to produce more accurate solutions. However it is not very 
meaningful to consider such choices without taking into account the execution 
time and consequently the computational efficiency (Sect. 4.5). For the moment one 
specific technique, Richardson extrapolation, for improving the accuracy will be 
considered. 
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4.4.1 Richardson Extrapolation 


It has already been shown that convergence can be demonstrated numerically 
(Sect. 4.1.2) by obtaining solutions on successively refined grids. For sufficiently fine 
grids the solution error reduces like the truncation error (Table 4.1). This last 
feature motivates Richardson extrapolation, which is used to improve the solution 
accuracy on a finite (but fine) grid. The basic idea is to add together suitably 
weighted solutions on successively refined grids to cancel the leading term in the 
truncation error. However the form of the truncation error arises from Taylor 
series expansions over the local grid spacing. It is not surprising, therefore, to find 
that Richardson extrapolation often requires a relatively fine grid to be effective. 

Consider two solutions of the diffusion equation obtained using the FTCS 
scheme with two x-steps, 4x, and 4х, but with the same value of s. A composite 
solution is constructed as 


T.=aT,+bT, , (4.42) 


where a and b are coefficients to be determined. Expansion of the FTCS scheme 
applied to T, as a Taylor series about the (j, n)-th node gives 


Beer а[Е,]3+6(Е,]3=0 , (4.43) 
j 


Qt ) Ox? 


where, from (4,7), 


[E,]5 2 0.5a4x2 (- z) (2) + O(Ax*) , 


дх* 


n 1\ (o*T 
[Ey] о 2-2) 1) «on . 


If Ax, = Ax,/2 the leading term in the composite truncation error can be eliminated 
if 
a+0.25b=0 . 


But from (4.42) it is clear that a+b=1. Therefore 


and the composite solution 7, 2(4/3) T, —(1/3) T, will be expected to have а 
solution error of O(Ax*) on a sufficiently refined grid. Some typical rms errors are 
shown in Table 4.3 with s — 0.5. The error is reducing like O(A x^) for the finer grids. 
However, although the accuracy level associated with Richardson extrapolation is 
superior on a fine grid, the reverse is true on a coarse grid. 

The FTCS scheme is unusual in the sense that a fourth-order scheme can be 
obtained by the special choice 5= 1/6 without the need for Richardson extrapol- 
ation. This is not the case for other schemes such as the fully implicit scheme (7.19) 
and the Crank-Nicolson scheme (7.22). But both of these schemes can be made 
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Table 4.3. rms errors for Richardson extrapolation (RE) 


Scheme Ax 0.1 Ax —0.05 Ах 0,025 
FTCS, s=0.5 0.4915 0.0869 0.0153 
ЕТС, s=0.5, RE 0.9744 0.0104 0.634 x 10-3 
FTCS, s=1/6 0.0017 0.542 x 107 * 0.339 x 1075 


fourth-order accurate by applying Richardson extrapolation. If 4x,— 4x,/2 the 
appropriate composite solution is 


1,-31,-41,, (4.44) 


as above. 

In the general case the choice of the coefficients a and b is determined by the 
power p of the leading Ах term in the expression for the truncation error (after 
converting At terms into equivalent 4x terms) and the ratio of the grid sizes, Ax, 


and Ax,. Thus 
( Ах, y 
A 
l = LL. (4.45) 


Implementing Richardson extrapolation for an unsteady problem like the 
diffusion equation, it is possible to treat the solutions Т, and Т, as independent 
solutions obtained on different grids at the final time at which T, is to be computed. 
Since Richardson extrapolation does not affect the evolving solution, but only the 
final solution, Dahlquist and Bjorck (1974) refer to this as passive Richardson 
extrapolation. If T, is computed at the end of every time-step and this solution 
forms the initial solution for computing both T, and T, during the next time-step 
one has active Richardson extrapolation (Dahlquist and Bjorck 1974, p. 340). 

It is instructive to compare the various schemes shown in Table 4.3 on the basis 
of execution time (CPT). For an integration over a finite time, CPT can be 
represented as 





k ak 


CPT =—-_-=—_~ 
dxdt 54х3” 
where k is a constant of proportionality. Therefore 


2.25ak бак 
AG c ӨРТЕ Aaa 





CPI. oe 


Ax › CPT,.o.s RET 


Thus it can be seen that the solution corresponding to СРТ, - 0,5, ке for 4x,— 0.1, 
4x, —0.05 is 50% more accurate than the solution corresponding to CPT,., at 
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х= 0.025 but has one-seventh of the execution time, However, the solution corre- 
sponding to CPT,. 1/6 for 4x=0.05 is approximately twelve times as accurate as 
the solution corresponding to CPT,. ; ве for 4x, = 0.025 and has one-third of the 
execution time. Clearly the use of Richardson extrapolation is effective but is not so 
efficient as the special case s=4. However such special cases are rare, particularly 
for equations and algorithms more complicated than the FTCS scheme applied to 
the diffusion equation. 


4.5 Computational Efficiency 


The question of what accuracy can be achieved with a given algorithm is closely 
connected with the computational efficiency of the particular algorithm. The 
computational efficiency may be defined as the accuracy achieved per unit ex- 
ecution time. Thus an algorithm that achieves a modest accuracy on a coarse grid 
with a small execution time may be computationally as efficient as another 
algorithm that achieves a higher accuracy on a refined grid requiring a large 
execution time. 
The computational efficiency (CE) can be defined by 


k 
CE= РТ? 
where є is the error in the approximate solution іп some appropriate norm (e.g. the 
rms norm) and CPT is the execution (CPU) time. The computational efficiency is a 
useful measure with which to determine whether higher-order finite difference 
schemes or higher-order finite elements are worth introducing. 

For most flow problems the use of high-order (say fourth-order or higher) finite 
difference schemes and high-order (say cubic or higher) finite elements are not 
justified, in terms of improved computational efficiency, particularly in three 
dimensions. However, at least second-order finite difference schemes or linear finite 
elements should be used to avoid the introduction of excessive numerical dissi- 
pation and dispersion (Sect. 9.2). 

In developing different computational schemes it is preferable to compare 
computational efficiency rather than accuracy or economy alone. If different 
computational schemes are compared on the same computer it is possible to 
directly measure the execution time. Care must be taken to ensure that the 
computer is being used in a dedicated mode, if the CPU time is not available. Also 
the execution time to read in data and to print out the results should be excluded or 
allowed for. 


4.5.1 Operation Count Estimates 


As an alternative to the direct measurement of the CPU time, an operation count 
estimate can be determined from a computer listing and some knowledge of the 
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relative execution time of basic operations. This can be done at various levels of 
complexity. The following approach is found to give consistent estimates. 

First a timing program is run to determine the approximate execution time 
associated with each type of operation. А typical timing program and a discussion 
of the procedure is given in Appendix A.1. The main purpose is to obtain the 
relative execution time of the various operations, since these can vary widely 
between different computers, operating systems and compilers. Relative execution 
times are shown in Table 4.4 for typical micro, supermicro and mainframe 
computers running FORTRAN. 

The second step is to identify the major classes of operations that account for 
most ofthe overall execution time. For the NEC-APC IV results shown in Table 4.4 
the following categories of operations would be worth isolating: 


(a) floating point operations (FL) 

(b) fixed point operations (FX) 

(c) replacements, i.e. equals signs (R) 

(d) logical statements, i.e. IF, GOTO, etc. (L) 

(e) mathematical library functions, i.e. SIN, EXP, etc. (M). 


To reduce the operation count to a single number it is necessary to assign a 
relative weighting to the above operations, ie. the operation count might be 
measured in equivalent floating point operations. А typical weighting for the NEC- 
APC IV would be (based on Table 4.4) 


1FL,,=20FX=10R=10L=0.1M . 
For lines 48 to 73 of DIFF (Sect. 3.5.2), and excluding WRITE statements, the 


Table 4.4. Relative execution time for basic operations 


Operation Microcomputer Supermicro Mainframe 
(NEC-APC IVF (SUN Sparc St])* (ІВМ-3090) 
add (FL) 1.0 1.0 10 
sub. (FL) 1.0 1.0 0.8 
mult, (FL) 1.1 1.0 0.8 
div. (FL) 1.4 5.8 3.1 
replace (=) 0.1 1.0 0.1 
IF state. 0.1 0.9 0.2 
add (FX) 0.05 0.6 0.2 
sub, (FX) 0.05 0.6 0.2 
mult, (FX) 0.4 0.6 0.6 
div, (FX) 0.5 5.4 3,2 
power 2.7 20 16,0 
SQRT 2.0 29 16,0 
SIN 10.0 29 17.6 
EXP 6.7 33 15.0 





* Microsoft Fortran 77 running under MSDOS (8086/8087 floating point hardware in single precision) 
* Fortran 77, unoptimised compiler running under UNIX 
* FORTV52, unoptimised compiler running under VM/XA2 operating system 


94 4. Theoretical Background 


following operation count is obtained. 
ОРСТ = [3FL + 6ЕХ + 5R+4L + (№, —2)(SFL+ 10ЕХ + 3К)] М, , 


where N, is the number of x-steps and М, is the number of t-steps. If N,—11 the 
operation count per time step is ОРСТ = 56FL,. In obtaining the above 
operation count a subscripted array has been assigned one fixed point operation to 
locate the subscript. 

Operation count estimates are particularly useful in identifying parts of the 
computer program that are expensive and in comparing the likely economy of 
different algorithms prior to coding and testing. Because of the variability between 
different computers and the effort required to obtain precise results, operation 
counts are most effective when used to look for order-of-magnitude differences in 
execution time. In addition the operation count procedure needs to be adjusted 
when used with vector and parallel-processing computers (Gentzsch and Neves 
1988). 


4.6 Closure 


For the equations that govern fluid motion it is not possible to demonstrate 
convergence directly. However, it is usually straightforward to show that the 
discretised form of the equations is consistent (Sect. 4.2). It is usually possible to 
show that a "linearised" version of the governing equations is stable (Sect. 4.3), 
although this may require a computer-based computation. Consequently the Lax 
equivalence theorem tends to provide a necessary rather than sufficient condition 
for convergence. In practice verification of stability usually requires numerical 
evaluation. 

The matrix and von Neumann methods for determining stability are strictly 
only applicable to linear equations, although both methods will give some guid- 
ance if the nonlinearity is frozen locally. An alternative energy method can handle 
some nonlinear equations directly. This method is described by Richtmyer and 
Morton (1967, p. 132). 

It should be stressed that real flow problems are computed on finite grids and 
that theoretical properties, deduced from allowing the grid to converge to zero size, 
may not be realisable on a particular finite grid. This is more likely to be a problem 
with higher-order schemes. 

The use of a Taylor series expansion to establish consistency (Sect. 4.2) is also 
important in providing an explicit expression for the truncation error. Because of 
the close correlation between the truncation error and the solution error, at least 
on a fine grid, anything that can be done to reduce the truncation error is also likely 
to reduce the solution error. This may entail making a specific choice for 
s(2a4t/Ax?) as in the FTCS scheme or it may involve combining solutions so that 
the leading term in the truncation error is eliminated as in Richardson extra- 
polation (Sect. 4.4.1). 
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For steady problems there is another alternative, called the deferred correction 
method (Smith 1965, p. 140). In this method a preliminary solution is computed 
and used to evaluate the leading term in the truncation error. This term is “added” 
to the original discretised equation as a source term and an improved solution 
obtained. In principle the improved solution can be used to reevaluate the leading 
term in the truncation error and the whole process repeated. Usually a single 
improved solution is computationally most efficient. 

Elimination of the leading term in the truncation error can also be used to 
construct higher-order schemes. After the initial discretisation the leading term in 
the truncation error is discretised and “added” to the original scheme. An example 
of this approach is provided in Sect. 9.3.2. 

Practical flow problems often require a computational solution in complicated 
three-dimensional domains, involving thousands of nodal unknowns. For such 
large-scale problems the assessment of the relative computational efficiency of 
competing numerical schemes is an essential prerequisite to the main body of the 
computational investigation. Finlayson (1980) provides work estimates and accu- 
racy comparisons of the finite difference, finite element and orthogonal collocation 
(Sect. 5.1) methods applied to ordinary differential equations, and one- and two- 
dimensional partial differential equations. 


4.7 Problems 


Convergence (Sect. 4.1) 


41 Modify program DIFF to use the following five-point symmetric scheme in 
place of the FTCS scheme. For interior points use 


Т 1 4 4 1 
oa (opina 253a Ta) 49 ; 


and for j 22 use 


T [11 1 1 

352% (52-:-32+03 И J| 47 , 
and the equivalent formula for j2 JMAX — 1. These are the same modifi- 
cations as required for Problem 3.7. 
(a) Obtain solutions with s=0.3, for 4x —0.2, 0.1 and 0.05. 
(b) Compare the reduction in the rms error with 4x with that of the FTCS 
scheme (Table 4.1). 

4.2 Repeat Problem 4.1 with the following scheme in place of the FTCS scheme, 
in program DIFF, 


05T'71—2T4-15TT a(T,-2TH Tha) _ 


0. 
At Ax? 
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For the first time step use the forward time difference formula 
oT _ Т" +1 __ Т" 
ПУТИНЕ 


These аге the same modifications as required for Problem 3.8. 


Consistency (Sect. 4.2) 


4.3 


4.4 


Establish that the scheme introduced in Problem 4.1 is consistent with the 
governing equation 2Т/0-а02Т/0х2--0. Is the order of the truncation error 
in agreement with the convergence rate found in Problem 4.1? Consider both 
interior points and points (j=2, JMAX — 1) adjacent to the boundary. 

Test the consistency of the scheme introduced in Problem 4.2. Is the conver- 
gence rate obtained in Problem 4.2 in agreement with the order of the 
truncation error? 


Stability (Sect. 4.3) 


4,5 


4.6 


47 


4.8 


4.9 


Apply the von Neumann stability analysis to determine the stability limits of 
the scheme introduced in Problem 4.1 for: 

(a) interior points, 

(b) points adjacent to the boundary, /=2 and JMAX—1. 

It may be convenient to evaluate numerically the expression for the amplitude 
ratio |G| as 9 and s are varied. 

Use the von Neumann stability analysis to establish the stability limits for the 
scheme introduced in Problem 4.2. This will require the solution of a 
quadratic equation іп |G]. 

Confirm the stability limits obtained in Problems 4.5 and 4.6 by obtaining 
numerical solutions for "unstable" values of s. 

Examine the stability of a discretisation of 0T/0t —«0? 0х? =0 based on a 
forward time representation for 27/0: and the formula obtained in Problem 
3.1 for 02Т/дх?. 

Apply the matrix method to the following discrete system, 


М.(АТ? !J At) -a[BL,,T?* ! £(1-)L,,T7]20, 
where 
M, AT, = 64T,., (1720) AT; 6 AT;, 1. 


This scheme is the same as Eq. (7.31) with y — 0. Obtain the following stability 
restriction on s and discuss the implications: 


s € (0.5 — 20)/(1 — 2B) . 


Solution Accuracy (Sect. 4.4) 


4.10 For the scheme considered in Problem 4.1 develop a Richardson extrapolated 


algorithm that will eliminate the leading term in the truncation error. Deter- 
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mine numerically whether this scheme is more accurate and if the theoretical 
convergence rate can be achieved. 

4.11 For the scheme considered in Problem 4.2 obtain a Richardson extrapolated 
algorithm and compare numerical solutions with those obtained in Problems 
4.1 and 4.2. 


Computational Efficiency (Sect. 4.5) 


4.12 For the scheme considered in Problem 4.10 determine the operation count 
and compare with that of the FTCS scheme when both are achieving the same 
accuracy. 

4.13 For the scheme considered in Problem 4.11 determine the operation count 
and compare with: 

(a) FTCS scheme when both are achieving the same accuracy, 
(b) the Richardson-extrapolated FTCS scheme. 

4.14 Discuss which is the “best” scheme of those considered in this chapter, taking 
into account computational efficiency, stability restrictions and ease of 
coding. 


5. Weighted Residual Methods 


Weighted residual methods (WRMs) are conceptually different from the finite 
difference method in that a WRM assumes that the solution can be represented 
analytically. For example, to obtain the solution of the diffusion equation (3.1) the 
following approximate solution would be assumed: 


J 


Т= У a(d) , (5.1) 


j=l 


where a,(t) are unknown coefficients and ф(х) are known analytic functions. The 
terms ф(х) are often referred to as trial functions and (5.1) as the trial solution. By 
forcing the analytic behaviour to follow (5.1) some error is introduced unless J is 
made arbitrarily large. It may be recalled that the finite difference method defines a 
solution at nodal points only. 

In Sect. 5.1 various weighted residual methods are indicated. In Sect. 5.1.1 some 
of these WRMs are applied to a simple ordinary differential equation to provide a 
tangible basis for comparison. The finite volume method, described in Sect. 5.2, is 
closely related to the subdomain method (Sect. 5.1). However in implementation, 
finite volume discretisation resembles a finite difference method. In Sect. 5.2.2 the 
particular difficulties of applying the finite volume method, when second de- 
rivatives are present, are indicated and a computer program, FIVOL is developed 
(Sect. 5.2.3) to solve Laplace's equation in an irregular domain. 

In this chapter the finite element and spectral methods will be developed from 
the Galerkin method which is a member of the class of methods of weighted 
residuals. The description of the finite element and spectral methods will be 
provided primarily through the medium of worked examples, which will emphasise 
the mechanics of applying the methods. The examples will also show, at least for 
the finite element method, a connection with some of the finite difference methods 
to be considered in Chaps. 7-10. Both the finite element and spectral methods are 
discussed at greater length by Fletcher (1984), and the spectral method by Canuto et 
al. (1987). 

In Sects. 5.3-5.5 the finite element method is considered. First the increased 
accuracy of linear and quadratic interpolation in one and two dimensions with grid 
refinement is demonstrated. Specific examples implementing the finite element 
method are provided in Sects.5.4 and 5.5. This includes computer programs, 
STURM Юг the Sturm-Liouville problem (Sect. 5.4.2) and DUCT for viscous flow 
in a rectangular duct (Sect. 5.5.2). The ability to handle distorted computational 
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domains is a feature of the finite element method; the use of isoparametric elements 
for this purpose is discussed in Sect. 5.5.3. 

The spectral method applied to the diffusion equation with straightforward 
Dirichlet boundary conditions is developed in Sect. 5.6.1. In Sect. 5.6.2 the difficult- 
ies of applying a spectral method with Neumann boundary conditions are 
discussed and the more general formulation indicated. The closely-related pseudo- 
spectral method is described in Sect. 5.6.3. 


5.1 General Formulation 


Many computational methods can be interpreted as methods of weighted residuals 
(WRMs). A comprehensive description of WRMs and applications up to 1972 is 
given by Finlayson (1972). Here WRMs will be developed sufficiently to demon- 
strate the connection between the finite element, spectral and finite volume 
methods. 

The starting point fora WRM is to assume an approximate solution. Equation 
(5.1) can be extended slightly by writing 


T (x, у, z, -Т.(х, у, 2,0+ У аф, у, 2), (5.2) 
}=1 


where То (х, у, z, t) is chosen to satisfy the boundary and initial conditions, exactly if 
possible. The approximating (trial) functions ф,(х, у, z) are known. In one spatial 
dimension the approximating functions might be polynomials or trigonometric 
functions, e.g. 


$j(x)2x^! or ф(х) = іп јлх . 


The coefficients a,(t) are unknown and are to be determined by solving a system of 
equations generated from the governing equation. For time-dependent problems a 
system of ordinary differential equations (in time) will be solved for а; (г); for steady 
problems a system of algebraic equations will be solved. 

It is assumed that the governing equation, for example the heat conduction 
equation, can be written 

_ oT æT 
L(T)=—-a— =0, 5.3 
(7) a Ax? 05) 

where the overbar denotes the exact solution. If the approximate solution, (5.2), is 
substituted into (5.3) it will not be identically zero. Thus we can write 


L(T)-R , (5.4) 


where R is referred to as the equation residual. R is a continuous function of x, y, z 
and t in the general case. If J is made sufficiently large then, in principle, the 
coefficients a(t) сап be chosen so that R is small over the computational domain. 
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The coefficients a,(t) are determined by requiring that the integral of the 
weighted residual over the computational domain is zero, i.e. 


[ff Wal, у, 2) Rdxdydz=0 . (5.5) 


By letting m=1,..., M a system of equations for the aps is generated. For the 
unsteady case considered here, this will be a system of ordinary differential 
equations. For the steady case a system of algebraic equations is generated. 

It may be noted that (5.5) is closely related to the weak form of the governing 
equation, i.e. 


ІШ W,,(x, у, 2) L(T) dxdydz=0 . (5.6) 


which allows discontinuities in the exact solution to appear (Lax and Wendroff 
1960). 

Different choices for the weight (test) function W, in (5.5) give rise to different 
methods in the class of methods of weighted residuals. Some of these methods are: 


(i) Subdomain Method. The computational domain is split up into M subdomains 
Dm, Which may overlap, and 
W,= 1 in Dp 
(5.7) 
=0 outside р, . 


The subdomain method coincides with the finite volume method (Sect. 5.2) in its 
evaluation of (5.5). Equations (5.5 and 7) provide an appropriate framework for 
enforcing conservation at the discretised equation level. In this way the conser- 
vation properties inherent in the governing equations are preserved. This is a 
particular advantage in obtaining accurate solutions for internal flows or flows 
with shock waves. 


(ii) Collocation Method 
W,(x)-ó(x — x,) , (5.8) 


where ó is the Dirac delta function and x —(x, y, z). The substitution of (5.8) into 
(5.5) indicates that the collocation method is obtained by setting R(x,,) 2 0. Thus 
finite difference methods are typically collocation methods without the use of an 
approximate solution. 

The orthogonal collocation method (Finlayson 1980) is a particularly effective 
interpretation of this method. In the orthogonal collocation method, the approxi- 
mating functions in (5.2) are chosen from orthogonal polynomials and the un- 
knowns in (5.2) are the nodal values (of Т). The collocation points х, are chosen 
from the roots of one of the approximating function orthogonal polynomials. 


(iii) Least-squares Method 
OR 


Ж, = a. (5.9) 
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Equation (5.9) is equivalent to the requirement that 
ШІ R? dx dy dz is а minimum . 
(iv) Galerkin Method 
„(х y, 2)=Фи(х, У, 2) , (5.10) 


ie. the weight functions are chosen from the same family as the approximating 
(trial) functions. If the approximating functions form a complete set (for poly- 
nomials a complete set would be 1, x?, x3, . . . , xM), (5.5) indicates that the residual 
is orthogonal to every member of a complete set. Consequently as M tends to 
infinity the approximate solution, Т, will converge to the exact solution 7. 


5.1.1 Application to an Ordinary Differential Equation 


A comparison of the various methods of weighted residuals is made by obtaining 
the solution to the ordinary differential equation 

di 

=~ 5=0 for 0<х<1, | (5.11) 
with boundary condition y=1 at x=0. This problem has the exact solution 
у=ехрх. 

A suitable approximate solution, equivalent to (5.2), is 


N 
у=1+ У axi . (5.12) 
]=1 


This approximate solution automatically satisfies the boundary condition. In 
contrast to the situation for (5.2), the coefficients а; in (5.12) are constants. 
Substitution of (5.12) into (5.11) produces the equation residual 


N 
К=-1+ У а()х 1-х). (5.13) 
рға 


The coefficients а; are determined by setting the integral of the weighted residual 
over the computational domain equal to zero. Thus, following (5.5), 


1 
f W(x) R(x) dx=0 . (5.14) 
0 


The different methods of weighted residuals are produced by different choices 
for the weights W,,(x). For example, the Galerkin method uses И, (х) = x" ^! for 
т= 1, М. Evaluation of (5.14) for each value of m produces a system of equations 
that can be written 


SA=D ; (5.15) 
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A is the vector of unknown coefficients а). For the Galerkin method an element of S 
is given by 


j 1 . 
= - 5.16 
5л) (аа ғ) 5-16) 


and an element of D by d,,=1/m. For the choice N =3, (5.15) becomes 





1/2 23 3/4 a, 1 
1/6 5/12 11/20 a, |=| 12|, (5.17) 
1/12 3/10 13/30 || а, 1/3 


which has the solution 
a,=1.0141 , a,=0.4225 , a,=0.2817 . (5.18) 
Substitution of (5.18) into the approximate solution (5.12) produces the result 
y=141.0141x +0.4225х? +0.2817х3 . (5.19) 


This solution is compared with the exact solution in Table 5.1. 


Table 5.1. Galerkin solutions of dy/dx — у=0 


Approximate solution Exact 
es solution, 
x Linear (№ = 1) Quadratic (М=2) Cubic (N=3) ў=ехр(х) 
0. 1.0 1.0 1.0 1.0 
0.2 1.4 1.2057 1.2220 1.2214 
0.4 1.8 1.4800 1.4913 1.4918 
0.6 2.2 1.8229 1.8214 1.8221 
0.8 2.6 2.2349 2.2259 2.2251 
10 3.0 2.7143 2.7183 2.7183 
Solution ` 
err. (rms) 0.2857 0.00886 0.00046 — 
Ros 0.5271 0.0583 0.00486 — 


The solution error for different values of N is plotted in Fig. 5.1. It is clear that 
the solution error reduces rapidly as N is increased. It can be seen from Table 5.1 
that both the rms solution error and the rms equation residual R,,,,, decrease rapidly 
as N increases. For the general problem the exact solution is not available and 
consequently the solution error cannot be computed. However the equation 
residual can be evaluated and this gives a qualitative indication of the closeness to 
the exact solution. 
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Fig. 5.1. Error distribution for Galerkin sol- 
ution of dy/dx—y=0 





Some of the other methods of weighted residuals discussed in Sect.5.1 are 
applied to (5.11). For N =3 the least-squares method produces the following system 
of equations in place of (5.17): 


1/3 14 1/5 a, 1/2 
1/4 8/5 2/3 a, |=| 2/3 |. (5.20) 
Us 2/3 33/35 || a, 3/4 - 


The subdomain method with intervals, ©<х< 1/3, 113<х<2/3 and 2/3<х<1.0 
produces the following system of equations: 


5/18 8/81 11/34] Га, 1/3 
3/18 20/81 69/324 | | a, |=| 1/3 |. (5.21) 
1/18 26/81 163/324 | | а, 1/3 


A collocation method with the residual evaluated at х=0, 0.5 and 1.0 gives the 
following system of equations: 


100 0. a, 1.0 
0.5 0.75 0.625 а, |=| 10 |. (5.22) 
0 10 20 а; 1.0 


The solutions for a,, a, and a, corresponding to the various methods аге shown in 
Table 5.2 and the corresponding approximate solutions (5.12) in Table 5.3. 

The optimal rms solution is obtained by minimising the rms error with three 
unknown coefficients. It is more appropriate to compare the various weighted 
residual solutions with this rather than the exact solution. Clearly the Galerkin, 
least squares and subdomain methods are producing solutions close to optimal. 
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Table 5.2. Comparison of coefficients for approximate solutions of 
dy/dx—y=0 


Coefficient 





Galerkin 1.0141 0.4225 0.2817 
Least squares 1.0131 0.4255 0.2797 
Subdomain 1.0156 0.4219 0.2813 
Collocation 1.0000 0.4286 0.2857 
Optimal rms 1.0138 0.4264 0.2781 
Taylor series 1.0000 0.5000 0.1667 


Table 5.3. Comparison of approximate solutions of dy/dx — у=0 





Least Sub- Collo- Optimal Taylor 

x Galerkin squares domain cation rms series Exact 

0. 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 
0.2 1.2220 1.2219 1.2213 1.2194 1.2220 1.2213 1.2214 
0.4 1.4913 1.4912 1.4917 1.4869 1.4915 1.4907 1.4918 
0.6 1,8214 1.8214 1,8220 1.8160 1.8219 1.8160 1.8221 
0.8 2.2259 2.2260 2.2265 2.2206 2.2263 2.2053 2.2255 
1,0 27183 2,7183 2.7187 2.7143 2.7183 2.6667 2.7183 


Solution 0.000458 0.000474 0.000576 0.004188 0.000434 0.022766 — 
error 
(rms) 


The accuracy of the collocation method is sensitive to the choice of the sample 
points. Evaluating the residual at x 20.1127, 0.50 and 0.8873 produces a solution 
identical with the Galerkin method. 

The solutions given in Table 5.3 are plotted in Fig. 5.2, and it is clear that 
weighted residual methods are effective (with the possible exception of the collo- 
cation method) in minimising the solution error over the whole domain. In 
contrast, the Taylor series expansion, which matches the exact solution at х= 0.0 
produces a large solution error away from х= 0.0. 

The various methods of weighted residuals are compared at greater length by 
Fletcher (1984, pp. 32-39) who makes the following observation: "the Galerkin 
method produces results of consistently high accuracy and has a breadth of 
application as wide as any method of weighted residuals". In the present situation 
the importance ofthe Galerkin method is that it leads directly to the Galerkin finite 
element method and the (Galerkin) spectral method. 
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Fig. 5.2. Error distribution for weighted 
residual solutions (М = 3) of dy/dx—y=0 





5.2 Finite Volume Method 


This method is similar to the subdomain method, except that there is no explicit 
introduction of an approximate solution like (5.2). The method is particularly 
concise if only first derivatives are present (Sect. 5.2.1). If second derivatives occur 
(Sect. 5.2.2) then additional manipulation is required. 


5.2.1 Equations with First Derivatives Only 


Here the finite volume method will be illustrated for the general first-order 
equation 
24 OF 20 1. (5.23) 
ôt Ox ду 


which by appropriate choice of 4, F and G represents the various equations of 
motion. For example if 4-0, F— i and G— Q? (5.23) is the two-dimensional 
version of the continuity equation (11.10). 
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Applying the subdomain method to (5.23) for the finite volume ABCD shown in 
Fig. 5.3 gives 
дд OF 26 


1| —+—+— |4хау-0, 5.24 
d, ЕЁ дх 5) У ( ) 


or applying Green’s theorem, 
аг. 
{440+ | H-nds=0 , (5.25) 
dt ABCD 


where H —(F, С). In Cartesian coordinates, 
H-nds=F dy—Gadx . (5.26) 


Equation (5.25) is just a statement of conservation. For the particular choice й = @, 
F =й, G— Qv, (5.25) coincides with an integral statement of the conservation of 
mass. Consequently the finite volume method is a discretisation of the governing 
equation in integral form (Sect. 11.2), in contrast to the finite difference method, 
which is usually applied to the governing equation in differential form. 

An approximate evaluation of (5.25) would be 


d DA 
PC d) 9, (F4y—GAx)=0 , (5.27) 
AB 


where sæ is the area of the quadrilateral, ABCD in Fig. 5.3, and the average value of 
q over the quadrilateral is associated with q; к. In (5.27), 


Ayag—Ypg— YA, 4X4g=Xg—X, and 


Ға-05(Е;,-1-5Е;,) > G4570.5(G;,-1- ©) , 





Fig. 5.3. Two-dimensional finite volume 
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with similar expressions for Дувс, etc. If æ is not a function of time, (5.27) becomes 


% dq, «ldt +0.5(F;x -1 t+ Fj ,)Ay 4g —0.5(G, x-1 +С) Ax ag 
+0.5 (Fig + Е, + 1,6) AV ac — 0.5(G, + Gi 1,4) 4хвс 
+0.5 (Е; ++ 1)4Уср -0.5(6; + Gi. Lei) Ахср 
+0.5(Ё;-1к+Р;к)Аурл—0.5((;-, 4c G;4)4xp4—0 . (5.28) 


J J 


If the global grid (j, К) is irregular the finite volume equation (5.28) provides а 
discretisation in Cartesian coordinates without the need to introduce generalised 
coordinates (Chap. 12). If the global grid is uniform and coincides with lines of 
constant x and y, (5.28) becomes 


d 
Ax4y = dix —0.5(G;,-, +G; )dx +0.5(F + F5, 1,к)АУу 


+0.5(6;.+6;,+1)4х—0.5(Е,_ iat Fj.) 4y=0 » 
or 


(5.29) 


which coincides with a centred difference representation for the spatial terms in 
(5.23). 

The finite volume method, which has been used for both incompressible and 
compressible flow, has two major advantages. First it has good conservation (of 
mass, etc.) properties. Second it allows complicated computational domains to be 
discretised in a simpler way than either the isoparametric finite element formulation 
(Sect. 5.5.3) or generalised coordinates (Sect. 12.2). 


5.2.2 Equations with Second Derivatives 


In Sect. 5.2.1 the finite volume method is applied to (5.23), in which only first 
derivatives appear, and produces a relatively straightforward discretisation for- 
mula (5.28). If second derivatives are present in the governing equation the finite 
volume method requires some modification. 

This situation is illustrated here by seeking solutions of Laplace’s equation 


ao 0$ 
рут , (5.30) 
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Fig. 5.4. Computational domain for the solution of Laplace’s 
equation 


in the computational domain shown in Fig. 5.4 with the following Dirichlet 
boundary conditions: 


on WX, ф-0 


sin 0 


оп XY, ф= (5.31) 





Гхү 


- 1 
on YZ, ф--- 
ү2 


on ZW. gone | 


Гу? 





With this choice of boundary conditions, (5.30) has the exact solution 


sin 0 





ф- , (5.32) 
7 
which permits the accuracy of the computational solutions to be assessed directly. 
If Laplace’s equation (5.30) had been written in polar coordinates the compu- 
tational domain shown in Fig. 5.4 would be regular. However, by deliberately 
solving this problem in Cartesian coordinates the ability of the finite volume 
method to handle distorted domains is illustrated, while retaining the advantage of 
an easily computed exact solution. Program FIVOL (Sect. 5.2.3) is sufficiently 
general to be applicable to other domain shapes once the interior grid points have 
been determined, e.g., using the techniques discussed іп Sects. 13.2-4. 
The finite volume method proceeds by applying the subdomain method to 
(5.30) for the finite volume ABCD shown in Fig. 5.5, giving 


27 21 
f (2% аку f A-nds=0 , (5.33) 


ABCD ду ABCD 
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Fig. 5.5. Finite volume for a distorted 
grid 





where 


дф дф 
H-n ds =~ dy — ду 4%. 


Following the same steps as in Sect. 5.2.1, (5.33) is evaluated approximately by 


29 | Ia | 
-- Ау -| — Ах 
Ё һк-1/2 АВ ау 1%-1/2 “ 
E | 
+ = AYsc-| = Ax 
Ё f+ 1/2,k BC ду 1+ 1/2. К 8 
дф дф 
+ | — Ду, P4 Ах 
Е |. а 0 LOY ка 7 


ae laaa La 
+| — AYpa— | — AXpa=0. (5.34) 
Е j-1/2,k pa ду |j-1/2,k pA 


Different techniques for evaluating [дф/дх], к- 1,2, etc. are discussed by Peyret 
and Taylor (1983, p. 111). Here [09/0x]; ,- 1/2 is evaluated as the mean value over 
the area B'BC'D'AA'B' in Fig. 5.5. Thus 


дф оф 
ôx - 4 dy , 5.35 
FIN Cie 2) xdy= (5 )ie y (5.35) 
D: " z -[— d 36 
E jk—1/2 (ы ) ЩЕ = (62) \¢ х, (5.36) 


апа 














f фаду фк -1 Аулв + ÓnAYgc + Фу к{Ус р +ФАДУрА 
ACD 
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and similarly for 


| фах. 
A'B'C'D' 
If the mesh is not too distorted, 
Аулнж-Ауср Аулв апа Aygc = —4Ayp,-2 Ay, ү, and 
San T Sareo = AXaAgÁYyk- 1,7 АУлв4Х%-і. (5.37) 


Therefore (5.35) becomes 


BH _АУлв(Фьк-1—Фь) + Ау 1a(95 704). a 
Ox |к-1/2 Sap (5.38) 
BH _ ГАхав(фук- Ф) E (Oa 04) 

ду |һк-12 | Sas 


Developing equivalent expressions for [66/@x]j+1,2,4, etc. permits (5.34) to be 
written as 


Охв(Фук-1— Фк) + Рав(фв- Pa) + Овс(Фу+1,к— Фк) + Рес(Фс- Фв) 
+Оср(Ф к «1— Фк) + Pep(Óp — Pc) + Орл(Ф;-1,к-Фк)+ PpA(04— Фр), 
-0, (5.39) 
where 
Олв=(Ахів + 4у4в)/8ав › Рав=(АхдьАхь-1,к+ Дудв АУ, -1,к)/Зав 
Овс= (Ахвс + 4уўс)/5вс › Рас =(АхвсАх +1, + AYnc ДУу+1, вс 
Ось=(4хёь+4уёь)/8сь >» — Pep (AxcpAxye it АусьАУк+1,)/$сь and 
Qna =(dxba+4Yba)/ Soa» Рьа=(Ахьд4Аху-1,3 + AYpAAYj-1,) Spa - 


In (5.39) фл, ха, ул are evaluated as the average of the four surrounding nodal 
values. Thus 


$47025(Ó; kt 6j ik t Ój-ia-1t Фук- 1). 


Substitution into (5.39) generates the following nine-point discretisation of (5.30): 


0.25(Рср- Рра)ф;–1,к+1 [Qcp +0.25(Рьс — Poa) Өкі 
+0.25(Рьс — Pcp)ó je 1,4 «1 [Орл-+0.25(Рер — Раһ)19;-1. 
— (Qus + Qsc * Qcp tpa); kt [Qsc +0.25(Рав- Роь)1ф;+1,к 
T 0.25(Pp4 — Рав)ф;-1.к-1 +[Очв+0.25(Рьл- Рас) Фк-1 
+0.25(Рав- Рьс)Фу+1.к-1=0. (5.40) 
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In (5.40) Q 43 and Рав, etc. can be evaluated once and for all when the grid points 
are determined. 

Equation (5.40) is solved conveniently using successive over-relaxation (SOR), 
Sect. 6.3. Equation (5.40) is manipulated to give an estimate of $71!, thus 


Фі-1025(Рср- Pp4)6;- 1.1 F [Qcp +0.25(Рьс — Ppa) Фль+ 1 
+0.25 (Pac — Pep) Pj+ 1.1 +[Орл +0.25(Рсь- Рав) ]Ф)- 1,4 
+[0вс +0.25(Рав— Poo) Фуа, 
+0.25(Ppa— Рав)ф;– 1-1 +[Олв+0.25(Рьл— Pac) Фу x-1 


+0.25(Рав – Рвс)Ффу+1,к-1}"/(Охв+ Овс + Qcpo-- QpA) (5.41) 
and the improved solution is 
$51 =P A(0$,— 954) , (5.42) 


where 4 is the relaxation parameter (Sect. 6.3). 

An interesting feature of the finite volume method is that Neumann (derivative) 
boundary conditions can be handled as readily as Dirichlet boundary conditions 
by direct substitution into (5.34). 


5.2.3 FIVOL: Finite Volume Method Applied to Laplace's Equation 


The above method has been incorporated into a computer program, FIVOL, a 
listing of which is given in Fig. 5.6. The main parameters used in FIVOL are 
described in Table 5.4. 

FIVOL reads in and writes out the control parameters (lines 1-25). The grid is 
set up and the initial solution (for SOR) and the exact solution computed (lines 
38-53). Since only the converged solution is of interest, the exact solution is also 
used as the initial solution for SOR. This leads to rapid convergence of the SOR 
scheme. The boundary conditions are set (lines 57-64); strictly this part of the code 
is not necessary since the boundary conditions have been evaluated, implicitly, by 
line 51. 

The grid related parameters (О „в, Рав, etc.) for each side are set (lines 68-123). 
The SOR iterative procedure evaluates (5.41) (lines 135-143), (5.42) at line 146 
computes |ф-—Ф, | в. (line 149) and exits the iteration loop (lines 127-151) if the 
tolerance (EPS) is satisfied. The final solution and exact solution are written out 
(lines 166-167) and |l — ||, computed and written out (lines 171-172). 

Typical output for a 6 x 6 grid 15 shown in Fig. 5.7. By starting from the exact 
solution it is apparent that convergence of the SOR scheme is rapid. The large 
value of | — Ф| is mainly due to the large change in the solution close to ryz. 

The accuracy increases as the grid is refined (Table 5.5) but more SOR 
iterations are required to reach convergence. The discretised equation (5.40) 
reduces to the centred finite difference scheme on a uniform rectangular grid 


ф;- Lk 205. $1.1 NIIT —26,k Фу 20 


1 di (543) 


э 
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1 
2c 

зс FIVOL APPLIES THE FINITE VOLUME METHOD TO THE SOLUTION OF 
4 c LAPLACES EQUATION ІН CARTESIAN COORDINATES ON A POLAR GRID. 
5 С THE DISCRETISED EQUATION IS SOLVED BY SOR 

с 

7 DIMENSION 1(21,21),Ү(21,21),0АВ(21,21),РАВ(21,21),08С(21,21), 
8 1PBC(21, 21) ,QCD(21,21) ,PCD(21,21) ,QDa(21,21) ,PDA(21,21), 
9 2PHI(21,21),PHIX(21,21) 

10 C 

11 OPEN(1,FILE=*FIVOL.DAT’) 

12 OPEN(6,FILE=*FIVOL.OUT’) 

13 READ( 1,1) JMAX, KMAX, NMAX 

14 READ(1,2)RW,RX,RY,RZ,THEB,THEN,EPS OM 

15 1 РОВМАТ(8Т5) 

16 2 FORMAT(8E10.3) 

17 С 

18 УВТТЕ( 6,3) 

19 WRITE(6,4) МАХ ,KMAX , ИМАХ , ЕРЗ , OM 

20 WRITE(6,5)RW,RX,RY,RZ, THES, THEN 

21 З ҒОВМАТ(? LAPLACE EQUATION BY FINITE VOLUME METHOD’ ,//) 

227. 4 FORMAT(’ JMAX-',I2,' KMAX=',I2,’ — WMAX-',I5, 

23 15Х,’ EPS*’,E10.3,*  OMz',F5.3) 

24 5 FORMAT(* — RW-',FB.3,' RX=',FS.3,’ RY=’,F5.3,’ RZ=',F5.3, 
25 15X,' THEBs',F5.1,* TBHEN-',F5.1,//) 

26 C 

27 УМАР = JMAX - 1 

28 KMAP = KMAX - 1 

29 AJM = ЛАР 

30 АКИ = КМАР 

31 DRWX = (RX - RW)/AJM 

32 DRZY = (RY - К2) /АКИ 

33 DTH = (THEN-THES) /AKM 

34 : PI = 3.1415927 

35 C 

38 C SET X, Y, EXACT AND INITIAL PHI 

37 C 

38 DO 7 K = 1,KMAX 

39 ake xK- 1 

40 THK = (THEB + AK*DTH)*PI/180. 

41 CK = COS(THK) 

42 SK = SIN(THK) 

43 DR = DRWX + (DRZY - DRWX)*AK/AKM 

44 RWZ = RW + (RZ - RW)*AK/AKM 

45 pO 6 J = 1,JMAX 

46 AJ=jJ-1 

47 R = RWZ + AJ*DR 

48 X(J,K) = R*CK 

49 Y(J,K) - R*SK 

50 PHIX(J,K) = SK/R 

51 PHI(J,K) = PHIX(J,K) S 

52 6 CONTINUE 

53 7 CONTINUE 

54 C 

55 C SET BOUNDARY VALUES ОЕ PHI 

56 С 

57 DO 8 J = 1,JMAX 

58 PHI(J,1) ғ 9. 

59 PHI(J,KMAX) = PHIX(J,KMAX) 

60 8 CONTINUE 


Fig. 5.6. Listing of program FIVOL 
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61 DO 9 К = 1, КМАХ 
62 РНІ(1,К) = PHIX(1,K) 

63 PHI(JMAX,K) = PHIX(JMAI,K) 

64 9 CONTINUE 

65 С 

66 С SET GRID RELATED PARAMETERS 

67 С 

68 DO 11 K = 2,KMAP 

69 KM =K-1 

70 КР = К+1 

71 DO 10 J = 2, МАР 

72 JM-J-1 

73 JP=jJ+i 

74 ХА = 0.25#(X(J,K) + X(JM,K) + X(JM,KM) + XCJ,KM)) 
75 ҮА = 0.258(¥(J,K) + Ү(ІМ,К) + Y(JM,KM) + YCJ,KM)) 
76 ХВ = 0.25*(X(J,K) + Х(2,КМ) + XCJP,KM) + ІСІР,К)) 
77 УВ = 0.25*(Y(J,K) + YCJ,KM) + Y(JP,KM) + Ү(ЈР,К)) 
78 ХС = 0.25«(Х(2,К) + ХОР,К) + ХОР,КР) + Х(Ј,КР)) 
79 ҮС = 0.2Б5%(Ү(2,К) + УОР,К) + Y(JP,KP) + YCJ,KP)) 
80 XD = 0.25*(Y(J,K) + X(J,KP) + Х(ЈМ,КР) + XCJM,K)) 
81 Үр = 0.25*(YO,K) + YCJ,KP) + Ү(ЈМ,КР) + Ү(ЈМ,К)) 
82 C 

83 C SIDE AB 

84 C 

85 DXA = XB - ХА 

86 DYA = УВ - YA 

87 DIK a X(J,K) - X(J,K-1) 

88 DYK s Y(J,K) - Y(J,K-1) 

89 ЗАВ = ABS(DXA*DYK - DXK*DYA) 

90 QAB(J,K) = (DXA*DXA + DYA*DYA)/SAB 

91 PAB(J,K) = (DXA*DIK + DYA*DYK)/SAB 

92 C 

93 C SIDE BC 

94 C 

95 DXB = XC - XB 

96 DYB = YC - YB 

97 DXJ = І(1,К) - Х(7+1,К) 

98 DYJ = Ү(Ј,К) - Y(J+1,K) 7 
99 SBC = ABS(DYJeDXB - DXJ*DYB) 

100 QBC(J,K) = (DXB«DXB + DYB*DYB)/SBC 

101 PBC(J,K) = (DXB«DXJ + DYBsDYJ)/SBC 

102 C 

103 C SIDE CD 

104 C 

105 DXC - XD - XC 

106 DYC - YD - YC 

107 DXK - X(J,K) - X(J,K*1) 

108 DYK - Ү(1,К) - Y(J,K*1) 

109 SCD = ABS(DXC*DYK - DYCeDIK) 


110 QCD(J,K) = (DXCeDXC + DYC*DYC)/SCD 


111 PCD(J,K) = (DXC*DIK + DYC*DYK)/SCD 
112 С 

113 C SIDE DA 

114 С 

115 DXD = XA - XD 

116 DYD - YA - YD 

117 DXJ = X(J,K) - 1(1-1,К) 


118 DYJ = ¥(J,K) - ¥(J-1,K) 


Fig. 5.6. (cont.) Listing of program FIVOL 
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119 SDA = ABS(DXJ*DYD - DYJ*DXD) 

120 QDACJ,K) = (DXD*DXD + DYD*DYD)/SDA 

121 PDA(J,K) = (DXD*DXJ + DYD*DYJ)/SDA 

122 10 CONTINUE 

123 11 CONTINUE 

124 С 

125 С ITERATE USING SOR 

126 C 

127 DO 14 N - 1,NMAX 

128 SUM - 0. 

129 DO 13 К = 2,KMAP 

130 KM-K-1 

131 КР =K + 1 

132 pO 12 J = 2, МАР 

133 М-1-1 

134 JP=J+1 

135 PED = 0.25*(PCD(J,K)-PDACJ, K)) *PHI(JM,KP) 

136 PAD = PHD + (QCD(J,K) + 0.25*(PBC(J,K)- PDACJ, K)) D *PHICJ, KP) 
137 PHD = PHD + 0.25*(PBC(J,K) -PCD(J, К) ) ePHI(JP,KP) 

138 PED = PED + (QDA(J,K) + 0.25€«(PCD(J,K)-PAB(J,K)) )*PHI(JM,K) 
139 PHD = PHD + (QBC(J,K) + 0.25*(PAB(J, K)- PCD(J, К)))+РНІ(ЈР,К) 
140 PHD = PED + 0.25*(PDA(J,K) - PAB(J,K))*PHI(JM,KM) 

141 PHD = PED + (QAB(J,K) + O.25*(PDA(J,K) -PBC(J,K)) ) *PHICJ, KM) 
142 PHD = PED + Q.25*(PAB(J,K) - PBC(J,K))*PHI(JP,KM) 

143 PHD = PHD/(QAB(J,K)*QBC(J, K)*QCD(J, KO *QDA(J, K)) 

144 DIF = PHD - PHI(J,K) 

145 SUM = SUM + DIF*DIF 

146 PHI(J,K) = PHI(J,K) + OM*DIF 

147 12 CONTINUE 

148 13 CONTINUE 

149 RMS = SQRT(SUM/(AJM-1, )/ AKM-1.)) 

150 IF(RMS .LT. EPS)GOTO 16 

151 14 CONTINUE 

152 WRITE(6,15)NMAX,RMS 

153 15 FORMAT(' CONVERGENCE NOT ACHIEVED IN’,I5,’ STEPS',b5X,^ RMS=', 
154 1E12.5) 

155 С 

156 С COMPARE SOLUTION WITH EXACT 

157 С 

158 16 SUM = 0, 

159 DO 21 K = 1,KMAX 

160 WRITE(6,17)K 

161 17 FORMAT(/,' K=*,I2) 

162 DO 18 J = 1, МАХ 

163 DIF = PHI(J,K) - PHIX(J,K) 

164 SUM = SUM + DIF*DIF 

165 18 CONTINUE 

166 WRITE(6,19)(PHI(J,K), J=1, МАХ) 

167 WRITE(6, 20) (PHIX(J,X),J=1, МАХ) 

168 19 FORMAT(' PHI=’,10F7.4) А 

169 20 FORMAT(' PHX=’,10F7.4) 

170 21 CONTINUE 

171 RMS = SQRT(SUM/(AJM-1, )/ (AKM-1.)) 

172 WRITE(6,22)N,RMS 

173 22 FORMAT(/,' CONVERGED AFTER ^,I3,' STEPS',4X,' RMS-',E12.5) 
174 STOP 

175 END 


Fig. 5.6. (cont) Listing of program FIVOL 
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Table 5.4. Parameters used in program FIVOL 


Parameter Description 

JMAX Number of points in the radial direction 
KMAX Number of points in the circumferential direction 
NMAX Maximum number of iterations 

RW ry (Fig. 5.4) 

RX ғұ (Fig. 5.4) 

КУ ry (Fig. 5.4) 

RZ rz (Fig. 5.4) 

THEB 8, X (Fig. 5.4) 

THEN 07, (Fig. 5.4) 

EPS Tolerance for convergence of SOR 

OM Relaxation parameter 4 in (5.42) 

PHI ф 

РНІХ ф 

PHD ф* 

ХА, УА X4. JA 

DXA Ахав 

DXK Ах -i.k 

RMS 19%- plema ПФ" Ф 





LAPLACE EQUATION BY FINITE VOLUME METHOD 


JMAXe 6 KMAX= 6 ИМАХ= 50 EPS= .100E-04 0М=1.500 
RW= .100 АХв1.000) КҮ=1.000 RZ= .100 THEB= .0 THEN» 90.0 
К= 1 


PHI= ‚0000 .0000 .0000 .0000 .0000 .0000 
PHX= .0000 .0000 .0000 .0000 .0000 .0000 


К= 2 
PHI= 3.0902 1.2439 .7445 .5205 .1929 .3090 
PHI» 3.0902 1.1036 .6718 .4828 .3768 .3090 


К= 3 
РИТ= 5.8779 2.3473 1.4036 .9828 .7441 .5878 
PHX= 5.8779 2.0992 1.2778 .9184 .7168 .5878 


К= 4 
РНІш 8.0902 3.1786 1.8990 1.3345 1.0163 .8090 
PHX= 8.0902 2.8893 1.7587 1.2641 .9866 .8090 


К= 5 
PHI- 9.5106 3.6209 2.1680 1.5349 1.1802 .9511 
РЫХ= 9.5106 3.3966 2.0675 1.4860 1.1598 .9511 


К= 6 
РНІв10.0000 3.5714 2.1739 1.5625 1.2195 1.0000 
РНХ=10.0000 3.5714 2.1739 1.5625 1.2195 1.0000 


CONVERGED AFTER 15 STEPS RMS= .13259E+00 Fig. 5.7. Typical output from FIVOL 
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Table 5.5. Finite volume solution errors with grid refinement 
(rw =rz=0.1, Гу Тұ = 1.0, 0, x = 0, 0,у = 90, А = 1.5) 


No of iterations 


Grid le — liens to convergence 
6x 6 0.1326 15 

Пхи 0.0471 19 

21х21 0.0138 51 


which has second-order convergence, іе. halving the grid (if fine enough) reduces 
the error by a factor of 4 (27). For the non-uniform grid results shown in Table 5.5 
the convergence rate is less than second order. Increasing grid distortion is 
expected to reduce the convergence rate further (see Problem 5.4). 


The finite volume method is widely used for transonic inviscid flow (Sect. 14.3) and 
for viscous flow (Sect. 17.2.3). 


5,3 Finite Element Method and Interpolation 


The finite element method was developed initially as an adhoc engineering pro- 
cedure for constructing matrix solutions to stress and displacement calculations in 
structural analysis. The method was placed on a sound mathematical foundation 
by considering the potential energy of the system and giving the finite element 
method a variational interpretation. 

However, very few fluid dynamic (or heat transfer) problems can be expressed in 
a variational form. But for many situations, the Galerkin method is equivalent to 
the Ritz method for solving variational problems. Consequently most of the finite 
element applications in fluid dynamics have used the Galerkin finite element 
formulation. Here we will focus exclusively on the Galerkin formulation of the 
finite element method. 

A traditional engineering interpretation of the finite element method is given by 
Zienkiewicz (1977). A more mathematical perspective of the method is provided by 
Strang and Fix (1973), Oden and Reddy (1976) and Mitchell and Wait (1977). 
Computational techniques for mainly structural applications of the finite element 
method can be found in Bathe and Wilson (1976). The application of the traditional 
finite element method to fluid mechanics is treated by Thomasset (1981) and Baker 
(1983). 

In comparison with the (traditional) Galerkin method defined in Sect. 5.1, the 
Galerkin finite element method has two particularly important features. Firstly the 
approximate solution (5.2) is written directly in terms of the nodal unknowns, i.e. 


T= У To% y, 2). (5.44) 


ізі 
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Equation (5.44) can be interpreted as an interpolation of the local nodal point 
solution T;. By working directly with the nodal unknowns one level of compu- 
tation is saved at the equation-solving stage (Chap. 6) and the nodal solution has 
direct physical significance. Also the similarity with the finite difference method is 
easier to see and the interpretation of the finite element method as a means of 
discretising (Sect. 3.1) the (continuous) governing partial differential equation be- 
comes more obvious. 

The approximating functions, ф(х, y, z) in (5.44), are often called trial or 
interpolation functions in the mathematical literature and shape functions (with the 
symbol М; replacing ¢,) in the engineering literature. The second important feature 
is that the approximating functions ¢,(x, у, 2) are chosen, almost exclusively, from 
low-order piecewise polynomials restricted to contiguous elements. This produces 
relatively few non-zero terms which, with appropriate ordering (Jennings 1977, 
Chap. 5), can be located close to the main diagonal of the matrix of equations. This 
is important at the equation-solving stage (Chap. 6) since it implies that the 
solution of the matrix of equations will be more economical. 

The finite element method achieves discretisation in two stages, both of which 
introduce errors (but not always additive errors). First a piecewise interpolation is 
introduced over discrete or finite elements to connect the local solution to the 
nodal values. This aspect wil] be examined in this section. The second stage uses a 
weighted residual construction (Sect. 5.1) to obtain algebraic equations connecting 
the solution nodal values. The mechanics of implementing the second stage and the 
solution behaviour are illustrated in developing programs STURM (Sect. 5.4.2) 
and DUCT (Sect. 5.5.2). 


5.3.1 Linear Interpolation 


Linear one-dimensional approximating functions, ф(х), are shown in Fig. 5.8. The 
function ф;(х) takes the values 


element А element В 
Ф; Фа СС! ф. =] 


(%-%-1) } (kjaki) 





*j-2 МЕ xj хы Xi,2 


Fig. 5.8. One-dimensional linear approximating functions 


118 5. Weighted Residual Methods 


фу- for х<х;-1. 

фу- Х- Х)-і for Xj-1 Sx Sx, (element А), (5.45) 
Xj; — Xj-1 

фу-1 Юг х-- ху, 
хах Г <х< | В 5.46 

ф = or x; $ x S xj, (element В), (5.46) 
Хужа X 

фу = 0 ігх>х;1. 


Thus ф; is non-zero only for the range xj, < x < xj,,. Ш a similar manner the 
approximating function $;,, is non-zero only between nodes j and j 2. 
For linear approximating functions, (5.44) has the form 


T= T, i ф;-1 + T; ój in element A А 


(5.47) 
Т = T; 0; + T;.i ізі іп element В 3 


ie. only two terms contribute in each element due to the local nature of the 
approximating functions (Fig. 5.8). In element A, ф; is given by (5.45) and ф)- by 


XX; 


¢;-1.=——* . _ (5.48) 
Xj-1 ~ Xj 
In element B, ф, is given by (5.46) and $4, by 
х-х, 
фу+1 -———— . (5.49) 
Хужа 7 Xj 


The above construction will be used to obtain a linear interpolation of the 
function 


у = 1 + соѕ(0.5лх) + sin(zx) (5.50) 


in the range 0 < x < 1. Equation (5.50) is shown іп Fig. 5.9. Тһе nodal values у; are 
obtained from the exact solution у; = y(x;). In a particular element the local 
interpolated solution y,, is given by 


Yin = Yj-1 9j-1(x) + y;%(x) іп element A and 


| (5.51) 
Yin = Yj ф(х) ty; 1 фу). 1(х) іп element B А 


Іп element A, ф;- ; and ф; are given by (5.48) and (5.45), respectively. In element В, 
ф; and ф;. ; are given by (5.46) and (5.49). 

The linearly interpolated solution (5.51) is plotted in Fig. 5.9. For a two-element 
division of the interval 0<x € 1 the interpolated solution is clearly rather 
inaccurate at the element midpoints, e.g. at 


х= 0.25, y= 2.63099, у, = 2.35356 , 
х= 0.75, ý= 2.08979, у, = 1.85355 . 
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Fig. 5,9 One-dimensional finite element inter- 
polation 















y a (5.50) 
/ quadratic 
Р linear lati \ interpolation 
20 interpolation \ 3 
NN 
10 


0 0.5 -x 10 


However, the accuracy improves rapidly as the number of elements subdividing the 
interval 0 € x € 1 is increased (Table 5.6). The rms error for linear interpolation, 
Table 5.6, is reducing like Ax?, where Ax is the span of each element. This is typical 
of linear interpolation. 


Table 5.6. Finite element interpolation errors for (5.50) 


Linear interpolation Quadratic interpolation 

Number of rms Number of rms 

elements error elements error - 
2 0.18662 1 0.04028 

4 0.04786 2 0.01599 

6 0.02138 3 0.00484 

8 0.01204 4 0.00206 


At this point the reader seeing this material for the first time may prefer to jump 
over quadratic interpolation and two-dimensional interpolation, and to pursue the 
discretisation process and equation solution for the Sturm-Liouville equation 
(Sect. 5.4). 


5.3.2 Quadratic Interpolation 


The use of linear interpolation, as in Figs. 5.8, 9, imposes a constraint on the 
approximate solution that it must vary linearly between the nodal points. We see 
(Table 5.6) that this introduces an error on a finite grid. Estimates for such 
(interpolation) errors are given by Mitchell and Wait (1977, p. 119), which are 
useful when the exact solution is not available. For the same grid spacing we would 
expect a smaller interpolation error if quadratic interpolation is used. 

As a one-dimensional quadratic approximating function, $; (Fig. 5.10) takes the 
values 
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element A element B Fig. 5.10. One-dimensional 


— ^^ M. MN quadratic approximating 


functions 


$j.1 $j Ф js] 





0 
Xj-2 Xil х} Kiel x Xi42 

ф- іогх<х;-;, 
x— xj- х-х;- 

ф;= (ыз) (2221) for xj-2 € x € x, (element А), 
X,—X;- X= X- 
j 1-2 j гі 
х-—х, х-х; 

ф- (==) (z=) for xj € x € xj; (element В), 
Хр Хужа Хр Х]+2 

ф; = 0 for x > х;+2 . 


(5.52) 


(5.53) 


The approximating functions are recognisable as Lagrange interpolation functions. 
Thus ф; = 1 at node j and ф, = 0 at nodes j - 2, j — 1, + 1 andj + 2. For quadratic 


interpolation (5.44) takes the following form in element А: 
T= T-29j-2+ G-1 G-1 + G9; , 

and in element B 
T = TO tl. Ojai + Th, 


ie. only three terms contribute in each element. 


For the sample problem (5.50) a suitable quadratic interpolation is 


Yin = У;-2 $j- 300) +у;-19;-1(%) + у;ф(х) in element А, 
Yin = У; P(X) + уу +1 Фу+1(х) + у;+ 2 Фу (х) іп element В, 


where. in element А (x;_. Sx € xj), 
X — Xj-1 X—X 
фм], 

Xj-27 Xj-1/ NXj-27 Xj 

X -—X;-25 х- х, 

Xj-1 77 Xj-2 Xj;-17Xj 

X-— Xj-2 X — Xj-1 
Ф (22202 )( X994, 

Х;- Х;-2 X; — Х;-і 


and similarly for element B (x, S x S xj42). 


(5.54) 


(5.55) 


(5.56) 


(5.57) 
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The case of a single quadratic element spanning the interval 0 € x € 1 is shown 
in Fig. 5.9. The quadratically interpolated solution is clearly more accurate than 
the linearly interpolated solution with the same number of nodes. The effect of grid 
refinement (increasing the number of quadratic elements) is to rapidly reduce the 
rms error (Table 5.6). The rate of reduction shown in Table 5.6 is like Ax? for 
quadratic elements. Consequently quadratic interpolation becomes progressively 
more accurate, compared with linear interpolation, as the grid is refined. 

In principle, cubic and higher-order interpolation are also available, in practice 
it is unusual to use interpolation that is of higher order than quadratic. Although 
higher-order interpolation is more accurate it generates systems of equations that 
have many more non-zero terms than when low-order interpolation is used; 
consequently higher-order interpolation is computationally more expensive, parti- 
cularly in two and three dimensions. Achieving the proper balance between 
accuracy and economy, i.e. in choosing methods that are computationally efficient 
(Sect. 4.5), is one of the more interesting strategic considerations in computational 
fluid dynamics. 

We can expect the finite element method solution error to reduce at the same 
rate as the interpolation error, with grid refinement. Generally the solution error 
will be larger on a given grid, since there is an additional error due to the nodal 
point solution not coinciding with the exact solution. As a rough guide, the use of 
linear approximating functions generates solutions of about the same accuracy as 
produced by second-order finite difference methods and the use of quadratic 
approximating functions provides about the same accuracy as third-order finite 
difference methods. 


5.3.3 Two-Dimensional Interpolation 


The concept of finite elements, from which the name of the method is taken, 
becomes more useful in more than one dimension. Thus in two dimensions, the 
local solution is interpolated, separately, in each of the four elements A, B, C and D, 
which surround the (j, k)-th node (Fig. 5.11). The approximate solution, equivalent 
to (5.44), is expressed conveniently in terms of an element-based coordinate system 
(£, п). For bilinear interpolation the approximate solution can be written 


T- 2, Т,ф(& n) , (5.58) 


where in each element —1 < € < 1, —1 <n < 1 (Fig. 5.11). From (5.58) four terms 
contribute in each element. The approximating function $,(£, у) takes the form 


$(5 п) = 0.25(1 + 6,0) (1. mm . (5.59) 


For example, in element C of Fig. 5.15 
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Fig. 5.11. Global/local nodal numbering 





ф = 0250 (13) , 
$: = 0251+ 801-1), 6.60 
фә — 025(1-- (1 5) , 
$, = 025(01— 5 +n) , 


so that ф, = 1 when č = £, у =n, and $, = 0 at all other nodes. For lines of 
constant č or у the approximating functions, ф, demonstrate the same type of 
variation as the one-dimensional approximating functions (Fig. 5.8). 

А three-dimensional view of a bilinear interpolating function centered at the 
(j, k)-th node is shown іп Fig. 5.12. An examination of (5.58) and Fig. 5.12 indicates 










РФФИ 
d 


Ay 






Fig. 5.12. Bilinear interpolation function 


5.3 Finite Element Method and Interpolation 123 


that T is continuous as element boundaries are crossed; however derivatives of T 
are not. 

Just as in one dimension, two-dimensional (bi)quadratic interpolating functions 
may be defined within a particular element (Fig. 5.13). For Lagrange elements the 
approximate solution, equivalent to (5.44), takes the form 


9 
T= У 1,(5). (5.61) 


1-1 


where the biquadratic Lagrange approximating functions are given by 


corner nodes: Фё, п) = 0.25 5, 6(1-- «епт mm , 
midside nodes (& = 0): — $,(£, m) = 0.5(1— £7) mn(1 +), 
midside nodes (т; = 0): (č, m) = 0.5(1—m?)&&0 68) , 
internal node: oé n) = (1- Е2)(1-п?). 


(5.62) 


The quadratic Lagrange approximating functions are interpolatory as before, i.e. 


ф=1 when €=&, =m, 
фу-0 at all other nodes . 


The form of the approximating function (5.62) can be visualised by considering 
constant values of € or п; then the approximating function resembles a one- 
dimensional approximating function, Fig. 5.10. This follows from the fact that the 
two-dimensional Lagrange approximating functions are just products of the cor- 


Fig. 5.13. Two-dimensional 
quadratic Lagrange elements 





124 5. Weighted Residual Methods 


responding one-dimensional Lagrange approximating functions. This has import- 
ant ramifications for interpolation in more than one dimension, particularly in 
relation to constructing dimensionally split schemes (Sect. 8.3). 

Two-dimensional interpolation on finite elements will be illustrated here for the 
function 


f= [1 — 0.8 cos(0.5nx)] [cos(0.5xy)] , (5.63) 


which is shown in Fig. 5.14 for the domain -1 x €1, -1 Ey & 1. 
This domain can be conveniently split into four elements (Fig. 5.15) over which 
the solution is to be interpolated (bi)linearly in terms of the nine nodal values f; = ў. 
In a particular element, say C, the interpolated solution fin is given by 


[= 1 


where (2, у) is а local element-based coordinate system. For the element С, 







Fig. 5.14. Two-dimensional functio 
J=[1—08cos(0.52x)j Гсов(0.5лу)1 





у=-1.0 
ха-1.0 ха0 х= 1.0 Fig. 5.15. Linear elements in two dimensions 


5.3 Finite Element Method and Interpolation 125 


€=2x—1 and g-22y-1. 


The (bi)linear interpolating functions ф; are given by (5.60) and more generally by 
(5.59). 

The interpolated solution with only four elements is not very accurate. For 
example at 


х-05, у-05, f=045711, f, = 0.375 . 


The complete interpolated solution of (5.63) is shown in Fig. 5.16. However the 
accuracy of the solution increases rapidly as the number of elements spanning the 
domain is increased (Table 5.7). The bilinear interpolation results shown in 
Table 5.7 indicate that the rms error is reducing like 4х? (= Ay’). 

A more accurate interpolation may be obtained using (bi)quadratic inter- 
polation. Each element then has nine nodal values (Fig. 5.13) which are interpolated 
by 


9 
fin = >, filén) , (5.65) 
іші 









(Fc 22 
MO 
22222222226 


Fig. 5.16. (Bi)linear inter- 
polation of (5.63) 


Table 5.7. Errors for two-dimensional finite element interpolation 


Bilinear interpolation Biquadratic interpolation 
Number of rms Number of rms 
elements error elements error 

4 0.08527 -- -- . 
16 0.02199 4 0.01199 
64 0.00557 16 0.00151 


64 0.00019 
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where the interpolating functions ф,(2, у) are given by (5.62). As indicated іп 
Table 5.7 elements with biquadratic interpolation are more accurate than elements 
with bilinear interpolation, for the same number of nodal points. With reference to 
Table 5.7, 16 contiguous linear elements contain the same number of nodal points 
as 4 contiguous quadratic elements. As more quadratically interpolated elements 
are used to span the domain —1 £ x € 1, —1 € y S 1, the rms error reduces like 
Ax? (= Ay’). 

Lagrange approximating functions can be introduced in three dimensions. For 
trilinear interpolation eight-node brick elements replace four-node rectangular 
elements in two dimensions. For triquadratic interpolation, 27-node brick elements 
replace 9-node rectangular elements in two dimensions (Zienkiewicz 1977, p. 171). 
In two and three dimensions, cubic and higher-order interpolation is possible but 
not of much interest due to the large number of nodes that become coupled 
(Fletcher 1984, p. 96). 

It is possible to eliminate the central node from the quadratic rectangular 
element shown in Fig. 5.13; this leads to Serendipity elements and interpolation in 
which no internal nodes are present. Such elements are possible for quadratic or 
higher-order interpolation in any number of dimensions. Serendipity elements are 
described by Zienkiewicz (1977, p. 155). 


5.4 Finite Element Method and the Sturm-Liouville Equation 


In Sect. 5.1.1 the Galerkin method is defined as a member of the class of methods of 
weighted residuals. In introducing the form of the approximate (interpolating) 
solution for the finite element method, Le. (5.44), it is stressed that the nodal values 
are the unknowns and the approximating functions are low-order piecewise 
polynomials. Here the Galerkin finite element method will be applied to the Sturm- 
Liouville equation, to illustrate the mechanics of the method. 


5.4.1 Detailed Formulation 

A simplified Sturm-Liouville equation can be written as 
d^Y 
dx 7.2 


with boundary conditions 


L 
+Y¥=F= -Y a,sin(I — 0.5)nx - (5.66) 
ігі 


с dY 
Ү(0) = 0 and —(1)20. (5.67) 
dx 
For the particular form of F shown in (5.66), the exact solution is 


sin[(1 — 0.5)лх] . (5.68) 


it 


= T—[(-— RET ny? 
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The Galerkin finite element method starts by introducing a trial solution, 
equivalent to (5.44), for the approximate solution Y ав. 


J 
Y= 5, Yl), (5.69) 
j=1 


and ф(х) are linear approximating functions that can be expressed, conveniently, 
in an element-based coordinate system x(£) as (Fig. 5.17): 


2[x — (x;-1 +x,)/2] | 
Ах; ' (5.70) 
2[x — (x; - x, .1)/2] 
Ах; 


in element А, $,(£ = 0.51+6), &= 
іп element В, $4,(£ = 0.51-6, ё&= 


Substitution of (5.69) into (5.5) produces an equation residual 
R=—,+Y-F. (5.71) 


Application of the weighted residual equation (5.5) with W,,(x) = @,,(x) gives 
1 d*Y 
[9,69 а Y-F ix =0. (5,72) 
2 4х 


It may be noted that the integration is made over the domain 0 € x < 1. Since the 
approximate solution is expressed, in (5.69) in terms of nodal values Y,, the highest 
derivative permitted to appear in (5.72) 1s the first (Mitchell and Wait 1977, p. 51). 
Therefore an integration by parts is applied as follows: 


1/ 42ү dy]! аф, 
Ko) |а) EG) 673 


Since Y(0) = 0, no equation need be formed with m = 0 (Fig. 5.17). For т 2 1, 
ф„(0) = 0. At x = 1, dY/dx = 0 therefore (5.73) becomes 


о) HO) - 


ELj lju У 
1449 —_—- 
1-1 | 
^ 
/ 
2 ^ 
a 

EL 1 Yo 
--- x Fig. 5.17. Sturm- 
х«0 x, х, 1-1 Ху“! Liouville problem 
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Substituting (5.74 and 69) into (5.72), and rearranging, gives 


J 1 —d d 1 
p || (== 4% + Pns ax |x, =f bn F dx , (5.75) 


0 


3 


=1,...,J. 


Since all the terms in the integrals are known, (5.75) is really a system of algebraic 
equations that can be written 


BY=G, (5.76) 


where b, ; is an element of B given by 
1 / —do, аф, 
bm. j = К On 4$; + 6.6, us , (5.77) 
0 


Y, is an element of Y and g,, is an element of С, i.e. 


1 


Im = |ФЬҒах. (5.78) 
0 


Equation (5.77) is most easily evaluated by converting the integral to one over 2 for 
each element in turn (Fig. 5.17). Because фи is also given by (5.70) only the elements 
adjacent to the j = mth node produce a non-zero contribution. For node m = J 
only one element contributes to the evaluation of (5.77 and 78) Thus B is 
tridiagonal, i.e. only the following terms are non-zero (1 € j € J — 1): 





1 Ax 
bj j-i “ate , 
1 1 Ах, + Ах; 
Ь,.--1--- LIP e 5.7 
^j CAES 4» 3 3) 
1 Ах; 
Ь = +1 
MUTA 6 7 


and for node m = J 


1 Ах 
by y= + 
1,1-1 Ax, 6^" 
—]1 Ax 
b, , = — +1 
11 dx, 3^" 
(1+1 =0). 


The step sizes, 4x; and Ах, 1, аге shown in Fig. 5.18. 
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element B 


element A 


element B 


element A Yje 
MM 





1-1 і ie ^ 12 
(а) 


Кір. 5.18 а, b. Element and nodal contributions to the discretised equations. (a) Linear interpolation. 
(b) Quadratic interpolation, corner node 


ігі 


At first sight, since F is known from (5.66), (5.78) can be evaluated directly. 
However, in the more general case where F is known but a complicated function of 
x, it is convenient to interpolate F in the same manner as Y, ie. 


J 
F= Y Fd, (5.80) 
j=l 
where the nodal values F, are given directly by (5.66). Substituting (5.80) into (5.78) 
gives 


J 1 - 
„= У Fy bmbj4x . (5.81) 
j=l 0 


For linear approximating functions, (5.81) can be evaluated as (1 € j € J — 1) 


Ах Ах,-- Ах Ах, 


and for т = J, g; = (4x,/6)F,_, +(4x,/3)F, . 
For the special case 4х; = 4xj,, = Ax it is instructive to write out (5.76) in full, 
and divide through by 4х. The result is (1 8j € J — 1) 


Y, .,—-2Y, Y, 1 2 1 
E ENOM LAOA 
1 2 1 
-[()ю-‹+(5)в (а) em 


Clearly the first group of terms coincides with a finite difference representation of 
d? Y/dx?. The second and third terms indicate an evaluation of Y and F which is a 
weighted average of three adjacent nodal values. The weighting (4, 3, 4) is character- 
istic of linear elements applied on a uniform grid. 
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Since the system of equations (5.76) is tridiagonal, an economical solution is 
available using the Thomas algorithm (Sect. 6.2.2). 
Solutions have been obtained with the following values of a, in (5.66): 


а: = 1.0, аз = 0.3, а= - 0.2, 


For equally spaced points the rms error for various values of J is shown in 
Table 5.8. The rms error is defined by 


_ J Ү.— ¥.}? 1/2 
в = | $ S] 
ізі 


a, = — 0.5 , as = 0.1 . 


J 


The reduction in error with reducing 4x shown in Table 5.8 is proportional to 4x?. 
This agrees with the theoretically predicted result for linear interpolation and is 
also the same order as for three-point centered finite difference formulae (e.g. 
Table 7.1). | 


Table 5.8. rms error for the simplified Sturm- 
Liouville equation 


J Ax теи 
6 0.2 7.830 x 1073 
Il 0.1 1.860 x 10-3 
21 0.05 0.447 x 10-3 
41 0,025 0.109 x 10-3 
81 0.0125 0.027 x 10-3 


5.4.2 STURM: Computation of the Sturm-Liouville Equation 


In Sect. 5.4.1 the finite element method, with linear interpolation, is applied to the 
simplified Sturm-Liouville equation (5.66) with boundary conditions given by 
(5.67). In this subsection a computer program, STURM, will be described to obtain 
finite element solutions to (5.66 and 67) with linear and quadratic interpolation. 

For quadratic interpolating functions (5.57), the equations, (5.77 and 81) for bm, j 
and а, have the following form for corner nodes in the interior: 





Бо 4% 

2—2 64x; 15” 

bo, mc 240) 

HM 34x, 15’ 

Б, Қаса) т +40. (5.84) 
Б. 24х31 

H*' 3 ХЫ 5” 

Ь., СОР Ч 

HM*? 6Axj4, 15 ’ 
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and 
AX; 2Ах, 4 
%- (S )n- Cs )n- құрғалы О 
24x; Ах | 
+( ИЛЕ : (5.85) 


The specific form of (5.84 and 85) results from the restrictions x;_, = 0.5(x;_2 + xj) 
and x;, = 0.5(х;+ х; з). These restrictions can be relaxed but this leads to more 
complicated algebraic expressions. 

At the boundary node m = J, (5.77 and 81) take the form 





ем, 
447 бдх, 15 
ЖЕ = -Axy , (5.86) 
b= дү ‚ and 
g= - (=). (шу, (б) | (5.87) 


For equations obtained using quadratic interpolation but centered at midside 
nodes, (5.77, 81) have the form 





ЖЕ - = = (5.88) 
рза ата , апа 
9; = (шн. (pe (a)r . (5.89) 


The system of equations (5.76) is tridiagonal if linear interpolation is used 
and alternating tridiagonal and pentadiagonal if quadratic interpolation is used. 
Efficient modifications of Gauss elimination are available for solving (5.76) for 
either situation and are described in Sects. 6.2.2 and 6.2.3. 

Program STURM applies the Galerkin finite element method to the 
Sturm-Liouville equation (5.66). A listing of STURM is given in Fig. 5.19. The main 
parameters used in STURM are described in Table 5.9. 

STURM reads in (and writes out) the control parameters (lines 11-21). The x 
locations, exact solution and nodal values of the right-hand side functions, F in 
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STURM COMPUTES THE SOLUTION OF THE STURM-LIOUVILLE PROBLEM, 

Р= 0 = 1, Е = -SIG(AL*SIN(L-0,5)*PI*X), USING THE FINITE ELEMENT 
METHOD ON A UNIFORM GRID WITH EITHER LINEAR OR QUADRATIC 
INTERPOLATION. 

THE BANDED SYSTEM IS SOLVED WITH BANFAC, BANSOL (SECTION 6.2). 


an an f& ок» 
с› 0 С) С) Сс) ос 


9 DIMENSION Y(65),YEX(65),X(65),B(5,65),G(65),F(€5) , A(5), YD(65) 
10 C 
11 OPEN(1,FILE-'STURM.DAT') 
12 OPEN (6,FILE-'STURM.OUT?) 
13 READ(1,1)INT,IPR,NY,A 
14 1 FORMAT(3I5,5E10.3) 
15 C 
16 IF(INT .EQ. 1)WRITE(6,2) 
17 IF(INT .EQ. 2)WRITE(6,3) 
18 WRITE(6,4)NX,4 
19 2 FORMAT(' STURM-LIOUVILLE PROBLEM,  FEM:LINEAR INTERPOLATION’) 
20 3 FORMAT(' STURM-LIOUVILLE PROBLEM,  FEM:QUADRATIC INTERPOLATION') 
21 4 FORMAT(^ NX-',I3,' А-!,БЕ10.3,//) 
22 С 
23 PI - 3.1415927 
24 NXP = NX - 1 
25 ANX = NXP 
26 DX = 1./ANX 
27 DXS = DX*DX 
28 C 
29 C SET GRID, EXACT SOLUTION AND RHS(F) 
30 C 
31 pos I= 1,NX 
32 AI=I-1 
33 X(I) = AI*DX 
34 F(I) = 0. 
35 YEX(I) = 
36 DO 5 L= 1,5 
37 AL =L 
38 AL = (AL - 0.5)*РТ 
39 DUM = A(L)*SIN(AL*X(I)) 
40 ЕС) = F(I) - DUM 
41 ҮЕХ(І) = YEX(I) - DUM/(1. - AL*AL) 
42 5 CONTINUE 
43 С 
44 C SET UP ARRAYS, B AND G 
45 C 
46 DO 7 I = 2,NX,INT 
47 IN=I-1 
48 IF(INT .EQ. 2)GOTO 6 
49 B(1,IM) = 0. 
50 B(2,IM) = 1./DXS + 1./6. 
51 B(3,IM) = -2./DXS + 2./3. 
52 B(4,IM) = B(2,IM) 
53 B(5,IM) - 0. 
54 G(IM) = (F(I-1) + 4.*F(I) + Е(І+1))/6. 
55 GOTO 7 Fig. 5.19. Listing of program STURM 


(5.66), are set (lines 28-42). The contributions о В and G in (5.76) are evaluated 
(lines 43-69) and an adjustment made to account for the boundary conditions. The 
banded system of equations (5.76) is solved by a call to BANFAC (line 83) to 
factorise B into L. U form and a call to BANSOL (line 86) to solve the factorised 
system. BANFAC and BANSOL are listed and described in Sect. 6.2.3. The rms 
error between Y, the solution returned by BANSOL, and the exact solution is 
computed and the output generated (lines 87-103). 
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56 6 B(1,IM) = 0. 

57 B(2,IM) = 4.*(1./0Х$ + 0.1)/3. 

58 B(3,IM) = 4.*(-2./DXS + 0.8)/3. 

59 B(4,IM) - B(2,IM) 

60 B(5,IM) > 0. 

61 G(IM) = 0.4e(F(I-1) + 8.*F(I) + Е(ІЗ1))/3. 

62 B(1,I) - 2.*(-0.25/DXS - 0.1)/3. 

63 B(2,I) = 2.*(2./0Х$ + 0.2)/3. 

64 B(3,I) = 2.*(-3.5/DXS + 0.8)/3. 

65 B(4,I) - B(2,I) 

66 B(5,I) * B(1,I) 

67 G(I) = -0.1*(F(I-1)*F(I*3)) + 0.2*(F(I)*F(I*2)) + 0.8*Р(1+1) 
68 G(I) = 2.*G(I)/3. 

69 7 CONTINUE 

тос 

71€ ADJUST FOR BOUNDARY CONDITIONS 

72 C 

73 В(2,1) = 0. 

74 B(1,2) = 0. 

75 B(3,NIP) = 0.5*8(3,ИХР) 

76 B(4, NXP) = 0. 

17 B(5,NXP) = 0. 

78 IF(INT .EQ. 1)G(NXP) = (F(NXP) + 2.*F(NX))/6. 

T9 IF(INT .EQ. 2)G(NXP) = 2.*(-0.1*F(NXP-1) + O.2*F(NXP) + O.4*F(NX)) 
80 C 

81 C SOLVE BANDED SYSTEM OF EQUATIONS 

82 C 

83 CALL BANFAC(B,NXP,INT) 

64 C 

85 C 

86 CALL BANSOL(G,YD,B,NXP,INT) 

87 C 

88 C 

89 C COMPUTE RMS ERROR 

90 C 

91 SUM = 0. 

92 pO 9 I = 2,NX _ 
93 Ү(І) = YD(I-1) 

94 DY = Ү(І) = YEX(I) 

95 SUM = SUM + рҮзру 

96 IF(IPR .EQ. 6)GOTO 9 

97 WRITE(6,8)I,X(I),Y(I), YEX(I) ,DY 

98 8 FÜRMAT(' 1=?,13,'’ X=',F7.5,' Ys',F7.5,' YEX-',F7T.5,' DY=', 
99 1Р7.5) 

100 9 CONTINUE 

101 RMS = SQRT(SUM/ANX) 

102 WRITE(6,10)RMS , ИХ 

103 10 FORMAT(//,' RMSz',E10.3,' ИХ=',І3) 

104 STOP 

105 END Fig. 5.19, (cont.) Listing of program STURM 


Typical output for linear interpolation with 4x=0.125 is shown in Fig. 5.20. 
This solution corresponds to the following choice of coefficients a, in (5.66): 


a, = 1.000 А а. = — 1.000 , аз — 1.000 , а. = — 1.000 А a, = 1.000 


The results shown in Fig. 5.20 indicate that the solution error is generally 
smaller close to a Dirichlet boundary condition than close to a Neumann bound- 
ary condition. 

A comparison of the rms solution errors for linear and quadratic interpolation 
with grid refinement is shown in Table 5.10. 


134 


5. Weighted Residual Methods 


Table 5,9. Parameters used in program STURM 


Parameter 


INT 
IPR 
NX 


BANFAC 
BANSOL 
RMS 


Description 


= 1, linear interpolation; = 2, quadratic interpolation 
= 0, only print rms error; = 1, also print solution 
number of points in the x direction, including x = 0 and x = 1.0 
coefficients, a,, in (5.66) 

nodal values of right-hand side of (5.66) 

exact solution, Я, (5.68) 

B in (5.76) 

G in (5.76) 

see Sect. 6.2.3 

see Sect. 6.2.3 

rms error 





STURM-LIOUYILLE PROBLEM, 


FEM:LINEAR INTERPOLATION 


NX= 9 Аш 

I= 2 X= .12500 
I= 3 X= .25000 
I= 4 X= ,31500 
Ін 5 Ха ,50000 
1= 6 Ха .62500 
I= 7 X= ‚75000 
Ін 8 Ха .87500 
Іс 9 Х=1.00000 
RMS= .380Е-02 


У= 
У= 
У= 
У= 
Үз 
Үш 
Үш 
Үш 


NXa 


.100E*01 -.100Е%01 


.11631 
.22691 
„33732 
44403 
„53796 
.62765 
„11190 
14987 


YEX= .11721 
YEX= .22733 
YEX= .33833 
YEX= .44633 
YEX= .53943 
YEX= .62899 
YEX= .71739 
YEX= .75848 


.100Е»01 -.100E+01 


DY=-.00089 
DY=-.00042 
DY=-.00100 
DY=-.00230 
DY=-.00147 
DY=-.00134 
рї=-,00549 
DY=-.00861 


Noteworthy features of Table 5.10 are that: 


.100Е%01 


Fig. 5.20. Typical output from STURM 


1) Higher-order interpolation (i.e. quadratic) is only slightly more accurate on a 
very coarse grid than low-order (linear) interpolation. 

1) The accuracy of higher-order interpolation increases at a faster rate than low- 
order interpolation as the grid is refined. 
iii) Theoretical rates of convergence are achieved approximately (4х? for linear 


interpolation; Ах? for quadratic interpolation). 


Table 5.10. Solution errors for the Sturm-Liouville problem 





Grid size, 
Ах 


1/8 

1/12 
1/16 
1/20 


rms error 


Linear interpolation Quadratic interpolation 


0.00380 0.00148 

0.00163 0.000178 
0.000904 0.000056 
0.000562 0.000023 
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It may be recalled that for the finite element method the approximating 
functions and weight functions are non-zero only in the small region surrounding 
the particular node. Consequently the finite element method is a local method. As 
with the finite difference method the (algebraic) equations generated by the finite 
element method connect together nodal values in a small region only. However, the 
number of connected nodes for the finite element method in multidimensions is 
considerably greater than for the finite difference method (Sect. 8.3 and Fletcher, 
1984). 


5,5 Further Applications of the Finite Element Method 


In this section the finite element method is applied to the diffusion equation 
(Sect. 5.5.1), the Poisson equation (Sect. 5.5.2) and Laplace’s equation (Sect. 5.5.3). 
The first problem illustrates the usual finite element practice of discretising only the 
spatial terms. The time derivative term is discretised as a separate step. The second 
problem introduces two spatial coordinates and, as a result, a more complicated 
structure in the discretised equations, The third problem outlines the isoparametric 
formulation which is useful if the computational domain is irregular. 


5.5.1 Diffusion Equation 


In this subsection the Galerkin finite element method is applied to the one- 
dimensional diffusion (heat conduction) equation 


oT | OT 

à 452 =0, (5.90) 
over the interval O€x < 1 and t20. Initial conditions of the form T(x, 0)=Т,(х) 
and boundary conditions T(0, t) = a and T(1, t) 2 аге assumed. 

Introducing the same linear approximating functions (5.70) as used for the 
Sturm-Liouville problem and applying the Galerkin finite element method in the 
same way produces the following result for internal nodes, on a uniform grid: 


ІГат 2[ar] Пат aT) 4-20, +T 41) _ | 
ЕЛЕНЕ КЫ 


For the Dirichlet boundary conditions indicated above по discretised equations 
are required at the boundary nodes. 

Since the finite element formulation is applied in space only, the treatment of 
the term дТ/ДІ in (5.90) is the same as the treatment of the undifferentiated (in 
Space) term Y in (5.66). Not surprisingly the same (1/6, 2/3, 1/6) weighting occurs in 
both (5.91) and in (5.83). If dT/dt is replaced by 47"*!/4t and the right-hand side of 
(5.91) is evaluated as a weighted average between the nth and the (n+1)-th time 
levels, as in (7.24), the result is 
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1 AT? 4 2 АТ + 1 | 1 А Т 
6\ At 3\ At 6\ At 
[CATT] +T ha) ВОТ - 2T] +T A) 
Ax? Ax? ° 











(5.92) 


where р is a parameter controlling the degree of implicitness. Equation (5.92) can be 
written compactly as 


M АТ?! 
UAI all -ALT + Bing Тр =0, (593) 
where 477*!  T?* ! — T? and the mass and difference operators, M, and Г, 
respectively, are given by 


М,=(1, 3,4) and L,-(l/Ax?, —2/Ax?, 1/Ах2). 


A comparison with the implicit finite difference scheme (7.24) indicates that the 
only difference is the spreading of the time derivative term over adjacent nodes in 
the finite element formulation. Equation (7.24) can be put in the same form as (5.93) 
if the following mass operator is defined: 


M£4=(0, 1,0). 


The similarity in structure of (5.93) and the equation it is created from, (5.90), 
suggests that the finite element method can be interpreted as a term-by-term 
discretisation process like the finite difference method. This interpretation is 
examined in Appendix А.2 and used in Sect. 8.3. 

Equation (5.93) can be rearranged to give the implicit algorithm 


(M, — AtaBL,,) T^*  -[M, + Ats(1 — ВЕТ". (5.94) 
] 1 


The significance of the structure of (5.94) may be better appreciated after reading 
Sect. 7.2. Repeated application of (5.94) centred at each node produces a tridiagonal 
system of equations which can be solved using the Thomas algorithm (Sect. 6.2.2). 

It will be seen that for the finite difference scheme, (7.24), setting В = 0 generates 
an explicit algorithm. This is not the case for the present scheme due to the 
‘implicit’ nature of M,. However, due to the symmetric nature of operators M, and 
L,, it is possible to obtain an explicit scheme from (5.94) for the particular choice 
At = Ax?/(6a p). 

Equation (5.94) is consistent with (5.90) with a truncation error of O(At?, Ax?) if 
В-0.5 and unconditionally stable if f => 0.5. For the particular choice f =0.5, (5.94) 
has been applied to the example considered in Sect. 7.2.5 and the results are 
presented in Table 7.6. 
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5.5.2 DUCT: Viscous Flow іп a Rectangular Duct 


Here the finite element method will be used to discretise a representative two- 
dimensional problem. Fully developed viscous (laminar) flow in a rectangular duct 
(Fig. 5.21) is governed by the z momentum equation 


др aw 0% 
Ф _ (Ss + as) | (5.95) 


where the axial velocity W(x, у) is to be determined for a given pressure gradient 
др/д2 and viscosity p. 
It is computationally advantageous to nondimensionalise (5.95) by introducing 





x y - - и 
Xna=— , ум > Wna 7 Ма [ yp . (5.96) 
д2 
Equation (5.95) becomes (dropping the subscript nd) 
БҮ? 02% 
-| —~4+——41= 97 
(5 mrt aa 81-0 (597) 


with boundary conditions 
W=0 at х=+1, у=+1. (5.98) 


Following a similar procedure as for the Sturm-Liouville equation, Sect. 5.4.1, а 
trial solution is introduced for w as 


I 
w= È widilr ») , (5.99) 
= 1 


where ф,(х, y) are two-dimensional interpolating functions, typically given by the 
equivalent of (5.60) ог (5.62). For the present problem (bi)linear interpolation will 
be introduced over individual elements, as in Fig. 5.15. 

Substitution of (5.99) into (5.97), application of the Galerkin formulation, (5.5, 
10), and some manipulation produces a system of algebraic equations 


BW-G, (5.100) 





Fig. 5.21. Viscous flow in a rectangular duct 
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where W contains the unknown nodal values у; in (5.99), an element of B is given by 


_ ft ЕГУ; дф, | 00; дф 
ac] ШЕ 2% Óx By ду ШЫ ау, (5.101) 





and an element of G is given by 


1 1 
= | f ф.14хаӣу. (5.102) 
-1-1 
The parameter m ranges over all the internal nodes, ie. 2€ j = МХ —1, 2=К< NY 
— 1. The integrals in (5.101) are best evaluated over each element, individually, 
using element-based coordinates (4, ў) as in (5.64). Only the four contiguous 
rectangular elements surrounding node m make nonzero contributions to the 
global integrals in (5.101 and 102). 

The structure of (5.101 and 102) is similar to that of (5.77 and 78), except that 
now the integration is over two dimensions instead of one. The form of the 
interpolating functions ¢,(x, y) is such that contributions to the integration can be 
determined as products of one-dimensional integrations (Appendix A.2). 

For a uniform grid (5.100) takes a particularly concise form 


b 2 
| B M,® Lyx $M,® Loy na -1, (5.103) 


where the subscripts j, К are part of the global grid, Fig. 5.11. In (5.103) L,, and L,, 
are directional difference operators 
=(1/dx?, —2/Ax?, l/Ax?) , 
І,,-(1/4у2, —2/Ay?, lJAy?? , (5.104) 
M, and M, are directional mass operators 
M, -МІ-(%% ‚ $) (5.105) 
and © is the tensor product. The operators have the following meaning: 


L _Wj-1,k 2М Wy auk 

нат 20. 
— 2 1 

Муи; к= ёму Wk иу and 


M,@L yam шығ Prag tae] p afina atwa 
y xx " 6 


Ax? 3 Ax? 
spon m ien (5.106) 


The construction of M, © L,,w; x follows a similar path. 


уу №}, 
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An equivalent three-point finite difference discretisation of (5.97) is 


bw. a L—2Ww; 4, +w; Wik-1—2W;k tW; 
Ө мела УЫ Жа 1, (5.107) 


which can be written in the form of (5.103) by defining 
M, з= МТ eg = (©, 1, 0) . 


The individual mass and difference operators appearing in (5.103) are identical 
with the operators appearing in (5.93). What is different is that the integration over 
two dimensions has introduced the (tensor) product of one-dimensional operators. 
Use is made of the above difference and mass operator notation in Sect. 8.3. This 
notation is discussed more fully in Appendix A.2. The significance of the notation is 
that it permits a term-by-term finite element discretisation 1 in a manner equivalent 
to that of the finite difference method. 

The system of equations formed by applying (5.103) at each internal node ( j = 2, 
J—1; k=2, K—1) is solved very conveniently using SOR (Sect. 6.3), in a manner 
like that used in Sect. 5.2.3. To fit in with the listing of program DUCT (Fig. 5.22), 
the algorithm is written as 


wt, —0.75[1. c (PAR1/6)QV7. іа МУ HW i-a T WIE 0) 
+ PAR2(w p, +11 )+РАВЗ(и" 1 ня, 1 ,)]/PARL , (5108) 


where PAR 1 =(b/a)?/Ax? + 1/Ay? 
PAR2=[ —(b/a)?/Ax? +2/4у?]/3 , . 
PAR3- [2(6/а)2/4х2—1/4у2]/3 . 


Equation (5.108) provides an estimate and (5.109) an improvement, 
whe =W AE wu), _ (5.109) 


where A is the SOR relaxation parameter. 

The above formulation is implemented by program DUCT (Fig. 5.22) to 
provide the velocity distribution in a rectangular duct (Fig. 5.21). DUCT reads in 
(and writes out) the various control parameters (lines 10-21). The exact and initial 
solutions are set up (lines 37-51). Since the number of iterations (SOR) to reach 
convergence is of no interest in this problem the exact solution is used as the initial 
solution for SOR. Subroutine VEL (line 47 and Fig. 5.23) computes the exact 
solution, obtained using the separation of variables method (Sect. 2.5.2), as 


8 2 МЕМ МЕМ (— 1)#*2/2- 1 
#= | — ~—— СО5(0,5ілх)со5(0. siny)) » 
(=) i=1,3,5,... (Gen y4j 
(5.110) 
8\5 NEM NEM 
a) у, (1/{i7j?[(ib/a)? +7}, (5.111) 
П} i21,3,5,... j9 1,3, 5,... 


where the total flow rate q,= || 9 dx dy. 
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1 
2c 

зс DUCT.FOR COMPUTES THE FULLY DEVELOPED VISCOUS LAMINAR FLOW IN A 
4c RECTANGULAR CHANNEL OF ASPECT RATIO, B/A (= BAR), BY FEM (LINEAR 
БС ELEMENTS) AND FDM (3PT CENTERED DIFFERENCES). 
вс SOR IS USED TO OBTAIN ITERATIVE SOLUTION 
7с 

8 DIMENSION ЧЕХ (41,41) ,WA(41,41) 

9c 

10 OPEN(1,FILE="DUCT. DAT? ) 

11 OPEN(6,FILE=*DUCT.OUT’) 

12 READ(1,1)NX,ME, NEM, IPR,ITMX,BAR,EPS, OM 

13 1 FORMAT(5IB,3E10.3) 

14 C 

15 IF(ME .EQ. 1)WRITE(6,2) 

16 IF(ME .EQ. 2)WRITE(6,3) 

17 2 FORMAT(' DUCT FLOW BY FEM: LINEAR ELEMENTS') 
18 3 FORMAT(? DUCT FLOW BY FDM: 3PT СЕЙ. DIFF.') 
19 WRITE(6,4)NX, NEM, BAR, ITMX,EPS,OM 

20 4 FORMAT(’ NXs',I3,' WEM=’,I2,'  B/Az',F5.2,? MAX ITER =',14, 
21 1: CONV TOL = ',E10.3,' OM = ',F6.3,//) 
22€ 

23 РТ = 3.1415927 

24 НН = NX/2 * 1 

25 ШҮ = NX 

26 NXP = NX - 1 

27 WYP = NXP 

28 АЙХ = FIP 

29 DX = 2./ANX 

30 DIS = DX*DI 

31 рү = рх 

32 DYS = DXS 

33 BAS = BAR*BAR/DXS 

34 РАКІ = ВАЗ + 1./DYS 

35 PAR2 = (2./DYS - ВА5)/3. 

36 PAR3 = (2.*BAS - 1./DYS)/3. 

37 c 

a8 C SET UP EXACT AND INITIAL SOLUTIONS 1 
39 C 

40 DO 7 J s 1,NX 

41 АЈ = 1-1 

42 X = -1. + AJ*DI 

43 DO & K = 1,NY 
44 AK K-71 

45 Y = -1. + AK*DY 

46 С 
47 CALL VEL(X,Y,NEM,PI,BAR,WD,QLX) 

48 C 

49 WEX(J,K) = WD 

50 WA(J,K) = WD s 

51 5 CONTINUE 

52 IF(IPR .EQ. O)GOTO 7 

53 WRITE(6,6) (WEX ( J, K) ,K=1, NY) 

54 6 FORMAT(' WEz*,10F7.4) 

55 7 CONTINUE 

56 WRITE(6,8) 


57 8 FORMAT(/) Fig. 5.22. Listing of program DUCT 
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58 С 
59 С ITERATE USING SOR 

60 С 

61 DO 13 I = 1,ITMX 

62 SUM = 0. 

63 DO 12 J = 2, ЮР 

64 й = 1-1 

65 JP2J*1 

66 DO 11 К = 2,NYP 

67 KK-K-1 

68 КР =К+1 

69 ТЕ(МЕ .EQ. 2)60ТО 9 

70 DUM = 1. + PAR1/6.*(WA(JM,KP) + WACJP,KP) + VA(JM,KM) + WA(JP,KM)) 
71 DUM = DUM + PaR2*(WACJ,KP) + WACJ,KM)) + PARSe(VACJM,K) *WA(JP,K)) 
72 WDD = 0.75*DUM/PAR1 

73 GOTO 10 

74 9 DUM s 1. + BAS*(WA(JM,K) + WACJP,K)) + (WACJ,KM) + WA(J,KP))/DYS 
75 WDD = 0.5*DUM/PAR1 

76 10 DIF = WDD - VA(J,K) 

77 SUM = SUM + DIF*DIF 

78 WA(J,K) = WACJ,K) + OM*DIF 

79 11 CONTINUE 

80 12 CONTINUE 

81 RMS = SQRT(SUM/(ANY-1.)/(ANX-1.)) 

82 IF(RMS .LT. ЕР5)60Т0 15 

83 13 CONTINUE 

84 WRITE (6,14) ITHX RNS 

85 14 FORMAT(! CONVERGENCE NOT ACHIEVED IN',I5,' STEPS’ SX, 

86 1! ITER RMS = ',E12.5) 

87 С 

88 С COMPARE SOLUTION WITH EXACT 

89 С 

90 15 SUM = 0. 

91 QL = 0. 

92 DO 18 J = 2,NXP 

93 DO 16 К = 2,NYP 

94 DIF = ША(Ј,К) ~ WEX(J,K) - 
95 SUM s SUM * DIF*DIF 

96 QL = QL + WA(J,K) 

97 16 CONTINUE 

98 IF(IPR .EQ. 0)GOTO 18 

99 WRITE(6,17) (VA(J,K) ,K-2, WYP) 


100 17 FORMAT(’ WA=',11F7.4) 
101 18 CONTINUE 


102 QL = QL*DX«DY 

103 RMS = SQRT(SUM/(ANY-1.)/(ANX-1.)) 
104 WRITE (6, 19)1I, RMS 

105 WRITE(6, 21 )МА(ИН, ИН) , WEX (NH, ИН) 
106 WRITE (6 ,20)QL,QLE 


107 19 FORMAT(/,’ CONVERGED AFTER ',I3,' STEPS’, 5X,°SOL RMS -',E12.5,//) 
108 20 FORMAT(' COMP. FLOW RATE = ',E12.5,' EXACT FLOW RATE = ',E12.5) 
109 21 FORMAT(' W(C/L)-',Ei2.5,'  JWEX(C/L)s',E12.5,//) 

110 STOP 

111 END 


Fig. 5.22. (cont.) Listing of program DUCT 


DUCT (lines 58-83) evaluates (5.108) and (5.109) iteratively until || wf, — и | 
is less than the tolerance EPS. Subsequently (lines 87-106) the solution is compared 
with the exact solution and written out. The main parameters used by the program 
DUCT are given in Table 5.11. Typical output produced by program DUCT for a 
linear finite element solution obtained on a 6 x 6 grid is shown in Fig. 5.24. 
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1 Fig. 5.23. Listing of subroutine VEL 
2 SUBROUTINE VEL(X,Y,N,PI,BAR,W,QL) 

3c 

ac FOR GIVEN X,Y COMPUTES AXIAL VELOCITY ТИ CHANNEL 
5c AND COMPUTES THE FLOW RATE 

6c 

7 CON = (8./PI/PI)**2 

8 ХАЙ = O.5*PI*I 

9 YAN = 0.БерізҮ 

10 W-0 

11 QL = 

12 DO 2 І = 1,И,2 

13 АІ = І 

14 СХ = С05(АІжХАН) 

15 ро 1 J= 1,8,2 

16 АЈ = ] 

17 СҮ = COS(AJ*YAN) 

18 IJ = (I*J)/2- 1 М 
19 IF(IJ .EQ. O)SIG = 1.0 

20 IF(IJ .GT. 0)SIG = (-1.)**IJ 

21 DEW = AI*AJ*((AI*BAR)**2 + AJ*AJ) 

22 DUM = SIG*CX*CY/DEN 

23 Mz Wo e DUM М 
24 QL = QL + 1./DEN/AJ/al 

25 1 CONTINUE ` 

26 2 CONTINUE 

27 W = CON*W 

28 QL = (16./PI/PI)*CON*QL 

29 RETURN 

30 END 


Table 5.11. Parameters used in program DUCT 








Parameter Description 

NX {= МУ), number of points in the x and у directions 
ME =}, linear FEM; =2, ЗРТ ЕОМ 

NEM number of terms in the equation for the exact solution 
IPR print control parameter 

ITMX maximum number of iterations (SOR) 

EPS tolerance (SOR) 

OM relaxation parameter (SOR), А 

X, Y nondimensional coordinates x, y 

VEL subroutine to compute the exact velocity at (x, y) and the total flow rate 
WEX exact axial velocity, % 

WA computed axial velocity, w 

WDD эў, 

RMS WF — Wy ға» ІМ. еа 

QL computed total flow rate 

QLX exact total flow rate, q, 





The effect of grid refinement for both the finite element and finite difference 
methods is shown in Table 5.12. It is apparent that both methods are converging 
like Ax?, Ay? with the finite difference method producing a slightly smaller rms 
error but a slightly larger error in the flow rate, q;. 


5.5 Further Applications of the Finite Element Method 
DUCT FLOW BY FEM: LINEAR ELEMENTS 
NX» 6 МЕМ=21 В/А= 1.00 MAX ITER = 30 СОНУ TOL = .100E-04 ОМ = 1.500 
WE= .0000 .0000 .0000 .0000 .0000 .0000 
МЕк .0000 .1386 .1926 .1926 .1386 .0000 
МЕ= .0000 .1926 .2750 .2750 .1926 .0000 
МЕн .0000 .1926 .2750 .2150 .1926 .0000 
УЕ= .0000 .1386 .1926 .1926 .1386 .0000 
МЕс .0000 .0000 .0000 .0000 .0000 .0000 
МА= .1453 .1990 .1989 .1453 
ЧА= .1990 .2842 .2842 .1989 
WA= .1989 .2842 .2842 .1989 
WAm .1453 .1989 .1989 .1453 
CONVERGED AFTER 10 STEPS SOL RMS = ,72390Е-02 
W(C/L)= .28421E+00 WEX(C/L)= .27502Е%00 
COMP. FLOW RATE =  .52952E*00 EXACT FLOW RATE = .56227Е%00 
Fig. 5.24. Typical output from DUCT 
Table 5.12, Accuracy of DUCT solutions with grid refinement (b/a= 1) 
Grid 6x6 11х11 21х21 Ехасі 
Method rms 4! rms 4! rms qı qı 
FEM 0.007239 0.5295 0.001571 0.5540 0.000256 0.5598 0.5623 


FDM 0.006480 0.4949 0.001480 0.5446 0.000205 0.5583 0.5623 


5.5.3 Distorted Computational Domains: Isoparametric Formulation 
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When the finite element method first became popular, arbitrarily 1965-1970, 
triangular elements were used extensively because they permitted complicated 
geometric shapes to be represented. Triangular elements and the appropriate 


interpolating formulae are described by Zienkiewicz (1977, p. 164). 


The ability to handle non-uniform and distorted computational domains has 
always been an important feature of the finite element method. However, distorted 
rectangular elements can also be introduced to represent complicated compu- 
tational domains, by exploiting the isoparametric transformation. Rectangular 
elements are usually preferred to triangular elements since they are easier to 
interface with grid generation programs (Chap. 13), particularly in three dimen- 
sions. However, Jameson (1989) describes an effective procedure based оп 
tetrahedral elements for representing the domain external to a complete aircraft. 
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/ Fig. 5.25. Isoparametric mapping 


A typical situation, requiring an isoparametric transformation of rectangular 
elements, is illustrated in Fig. 5.25. The elements shown in the physical space (x, y) 
are distorted. However, these can be readily transformed into uniform elements in 
(2, п) space by defining (for bilinear interpolation) 


me 
ll 
Me 


фФ(6 mx, and у 


1 1= 1 


х= ФИФ, Фу. (5.112) 


where (ху, yj) are the coordinates of the Ith corner node of B in physical space and 
фи (Е, n) аге the same bilinear approximating functions as defined by (5.59). The use 
of the isoparametric formulation affects the evaluation of the weighted residual 
integral (5.5). 

For example, the application of the Galerkin finite element method to the 
problem of solving Laplace’s equation in a distorted domain (Sect. 5.2.3) produces 
a linear system of equations, equivalent to (5.100), i.e. 


BW=G . (5.113) 
An element of B is given by 


209,00, , 042m 


bm j= dx dy , 5114 
4 ШЕ ôx dy х) у (5414) 


where т is the node at which the weight function is centred and j is the contributing 
node. In (5.114) the integration is carried out over the area WXYZ (Fig. 5.4). 
However, direct evaluation of (5.114) in physical space would be impossible to 
generalise to arbitrary domains. 

The philosophy behind the isoparametric formulation is that it is straight- 
forward to carry out the integration (numerically) in the transformed (6, у) space 
where the elements are uniform. To illustrate this, attention will be focussed on the 
first term in (5.114), i.e. 
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ПЕЕ jas ау. (5.115) 


The derivatives 0ф;/дх and дф,/ду аге related to дф,/0Ё and 0%,/0 by 


9$;/05 \ _ дф//дх 
(coon) =U] (sen) i (5.116) 
where the Jacobian [J] is given by 
_ (дх/06 ду/дс 
(ое an) | (5.117) 


Тһе terms іп the Jacobian are evaluated from (5.112). For example 


тар) (2 (5, п)» . (5.118) 


From (5.116) the following explicit expressions for 0$, /дх (and дф„/дх) in (5.115) 
are available: 


САТЫ мө 


Іп (5.115), dx dy=detJ d£ dy. Substitution of (5.119) into (5.115) produces the 
following contribution to bm, ; from each element: 


ЖЕТЕ 
GE е 


Тһе advantage of the isoparametric formulation is that all terms in (5.120) аге 
functions of (£, 7) and the elements are uniform in (Ё, 4) space, as indicated in 
Fig. 5.25. Thus the integration can be carried out, conveniently, using Gauss 
quadrature (Isaacson and Keller 1966, p. 327). The isoparametric formulation is 
described, in greater detail, by Zienkiewicz (1977, p. 188). 


5.6 Spectral Method 


The spectral method uses the same form of approximate solution as the (tra- 
ditional) Galerkin method, (5.1). Like the (traditional) Galerkin method the ap- 
proximating functions and weight functions are non-zero throughout the com- 
putational domain. In this sense the spectral and (traditional) Galerkin methods 
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are global methods (Fletcher 1984). In contrast to the situation for the finite 
element method, the unknown coefficients a,(t) in the approximate solution (5.1) 
cannot be identified with nodal unknowns. 

The most important difference between the spectral method and the (tra- 
ditional) Galerkin method is that the spectral method uses orthogonal functions for 
the approximating and weight functions. The approximating and weight functions 
are orthogonal if 


9, у,29һ-(х,у, z)dxdydz#0 when maj 
=0 whenmzj. 


Fourier series, Legendre polynomials and Chebyshev polynomials are examples of 
orthogonal functions. 

It will be seen in the example to be presented shortly that the use of orthogonal 
approximating and weight functions simplifies the structure of the (ordinary 
differential) equations created by the spectral method. In addition, if the boundary 
conditions are appropriate the spectral method can generate solutions of very high 
accuracy with relatively few terms in the approximate solution, if the exact solution 
is sufficiently smooth. These aspects are discussed by Fletcher (1984, Chap. 5) and 
by Canuto et al. (1987). 


5.6.1 Diffusion Equation 


The mechanics of implementing the spectral method can be most easily understood 
by considering an example. The diffusion equation 

oT | OT 

——g4—.-0 5.121 

Qt ax? ( ) 
іп the computational domain 0$x <1 will be examined so that direct comparison 
with the finite difference and finite element methods will be possible. The following 
boundary and initial conditions are assumed: 


Т(0,)-0, (1, )=1 and T(x, 0)=sin(xx)+x . (5.122) 
The starting point for the spectral method is the introduction of the approxi- 
mate solution 


3 
T=sin(nx)+x+ У а (дѕіп(јтх) , (5.123) 
ј=1 


where а, (t) are the unknown coefficients. The form of (5.123) automatically satisfies 
the boundary and initial conditions. Substitution of (5.123) into (5.121) produces a 
residual - 


da. 
R= > Е + af ina sin jnx) +ап?ѕіп(лх) . 
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Evaluation of the weighted integral, i.e. (5.5) with (5.10), 
1 
{[sin@max)R] Ах-0, gives 
0 


J да; . 2 1 . | . + 
У dr +a( jx)*a; f[sin(mzx)sin( jax)]dx 
ізі 9 


1 
+ ат? | [ѕіп(тлх)ѕіп(ях)] dx=0. (5.124) 
о 
The advantage of using orthogonal functions іп (5.124) is that the first integral is 


non-zero only if j=m and the second integral is non-zero only if m=1. Conse- 
quently (5.124) becomes 


а . 
E +(тл)%а„+г„=0 , т= 1, 24. J , (5.125) 
where 
= an? if m- 
Py = OT ifm=1 , (6.126) 
=0 ifmzl. 
The solution of (5.125) can be written down directly as 
a, =e 7 ent 1 , 
а,-0, m=2,...,J. у 
Equation (5.123) then becomes 
T-sin(zx)e "+x , (5.127) 


which is, in fact, the exact solution of (5.121). This has occurred, fortuitously, 
because of the particular choice of boundary and initial conditions (5.122). A more 
realistic situation is provided if the initial condition is given by 


T (x, 0) = 5x 4х? (5.128) 


and if 5x —4x? replaces sin(xx)+x in the approximate solution (5.123) The 
application of the spectral method with (5.128) is similar to that given above except 
that (5.126) becomes 


16 


Pin , 
т 


т-1,3,5,..., (5.129) 


-0, m=2,4,6,.... 
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If da,/dt in (5.125) is replaced with a forward time difference formula 
(a! — a)/At, the following explicit algorithm can be obtained: 


añt! = а" — At[(xmz)?an 4 r,] . (5.130) 


For this case, (5.130) has been integrated with a step-size 4t = 0.001. The results for 
different J are shown in Table 5.13. Also included in Table 5.13 are the exact 
solution T and an approximate solution Т, that only contains errors associated 
with the numerical integration scheme. The exact solution is obtained by the 
separation of variables technique (Sect. 2.5.2) as 


Т=х+4 у Q/EGQj — т } 4 sin[(2j – Dzx]exp( ^ a[(Qj — тің) . (5.131) 
іғі . 
The approximate solution T, is obtained by assuming a solution of the form 
T,=x+4 у (2/[(2j — Dr] y sin[Qj — D)zx1X;(t) . (5.132) 
ізі 
Substitution into (5.121) then requires the solution of 
5 + De PX;=0 . (5.133) 
This is obtained using the same time differencing as for а, in (5.130) and the same 


time-step, 4t —0.001. Substitution into (5.132) gives the solution for T, shown in 
Table 5.13. 

The results shown in Table 5.13 indicate that T becomes more accurate as J is 
increased. However, it should be remembered that the error between Т and the 
exact solution T comes partly from the approximation inherent in the spectral 


Table 5,13 Spectral solutions, at x= 0.5, of (5.121) 


Approximate solution Т Exact 

------------:-------- Approximate solution 
t J=1 J=3 J=5 solution 7, T 
0.00 1.500 1.500 1.500 1.500 1.500 
0.02 1.314 1.347 1.338 1.340 1.341 
0.04 1.162 1.199 1.191 1.193 1.194 
0.06 1.037 1.075 1.067 1.069 1.071 
0.08 0.935 0.973 0.965 0.967 0.969 
0.10 0.851 0.889 0.881 0.883 0.885 
0.12 0.782 0.820 0.812 0.814 0.816 
0.14 0,725: 0.764 0.755 0.757 0.759 
0.16 0.679 0.717 0.709 0.711 0.713 
0.18 0.641 0.679 0.671 0.673 0.675 


0.20 0.610 0.648 0.640 0.642 0.643 
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method and partly from the error in the time differencing introduced to obtain 
numerical solutions of (5.125). The difference between the two approximate sol- 
utions, Т and Т, is due to the error in the spectral method, only. Clearly the use of 
relatively few unknowns in the approximate solution provides a solution of high 
accuracy. This is typical of spectral methods (Fletcher 1984, Chap. 6). 

Applications of the spectral method are given in Sects. 15.3.3 and 17.1.6. The 
traditional areas of application of the spectral method are in global meteorological 
modelling (Bourke et al. 1977) and in the direct simulation of turbulence (Hussaini 
and Zang 1987). Both of these areas require governing equations that include 
nonlinear terms particularly for the convection. 

In order for the spectral method to generate an efficient computational al- 
gorithm it is necessary to introduce transform techniques to handle the nonlinear 
terms. Conceptually this can be thought of as introducing local nodal unknowns 7) 
in place of the a,(t) coefficients in (5.123) performing the nonlinear products in 
terms of T, and re-evaluating the equivalent a; coefficients. Transform techniques 
are discussed by Fletcher (1984, Chap. 5) and by Canuto et al. (1987). 


5.6.2 Neumann Boundary Conditions 


In Sect. 5.6.1 the spectral method was applied to the diffusion equation (5.121) with 
Dirichlet boundary conditions and this led to explicit ordinary differential equa- 
tions (5.125) for the unknown coefficients а, in the trial solution. 

Here a problem with more difficult boundary conditions will be considered to 
demonstrate that the spectral method often entails additional manipulation, during 
the discretisation stage. Specifically, the diffusion equation (5.121) is to be solved in 
the domain 0<x €1.0, t20, for initial and boundary conditions 


T(x, 0)=3—2x—2x*+2x?3 , (5.134) 


To=- and 7(1,9=10. (5.135) 


Rather than attempting to incorporate the initial and boundary conditions into the 
approximate solution, as in (5.123), a general approximate solution is introduced as 


J 
T (x,t)=bo(t)+ У, Га(0 sin(j2nx) 4 bj(t) cos(j2nx)] . (5.136) 
ізі 


For (5.136) to satisfy the boundary conditions (5.135) the coefficients must be 
related as 


J 
У.а(лй--2, УБ=. (5.137) 


150 5. Weighted Residual Methods 


Using (5.137), a, and b, can be eliminated from (5.136) to give 


T= cos(J2nx)+( 2) sin(J 22x) + Y. sin ІОлх1- G МЕТ 


ізі 


+; b; (cos([j(2nx)] — cos([J(22x)]) . | (5.138) 
120 


Implementation of the Galerkin spectral method (Fletcher 1984, р. 30), requires 
that the weight functions in (5.5) are identical with the functions in (5.138), i.e. 


W,, (x) =sin[m(2zx)] — (5 асое» , 1<т</—1, 
W, (x) = cos[m(2zx)] —cos[J(22x)]-, O<msJ-1. 


(5.139) 


Substitution of (5.138) into the diffusion equation (5.121) and evaluation of (5.5) 
based on (5.139) produces the following systems of ordinary differential equations: 





m + [a(2nm)* a, +7 M ) ДЕ қарауда) - -2аОлт), IlsmsJ-1, 
(5.140) 
т + [а(2лту?]р, + у Le [oJ Mb re , i<msJ-1, 
т (5.141) 
апа 
2db, ^C (db, ml. 2 
i -2, P q, +a) ъ}- a(2nJ) (5.142) 


Equations (5.140) form a system of coupled ordinary differential equations for the a; 
coefficients that are independent of the ordinary differential equations (5.141, 142) 
for the b, coefficients. These two systems are marched in time to provide values of 
a,(t) and b(t) that are substituted into (5.138) to give the computational solution 
Т(х, t). 

However, whatever marching algorithm i is used to advance the aj, b; coefficients 
in time, it will be necessary to undertake an initial factorisation and a matrix 
multiplication at every time-step. Since the current problem is linear and accurate 
solutions can be obtained with relatively few terms in the approximate solution 
(5.136), the additional execution time associated with the coupling is not excessive. 

The starting values, aj, b? are obtained by fitting the approximate solution 
(5.136) to the initial solution (5.134), ie. 


11 1 1 _ 
bo— ; =2(7( (x, 0) 5іп(/2лх)ах , b-2[ T (x, 0) cos( j2nx)dx . 
0 о 


(5.143) 


The numerical solution of this problem is to be sought in Problem 5.17. 
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An alternative procedure called the tau method, which is closely related to the 
Galerkin method, is also available. In the tau method the weight functions are 
chosen to match the approximating functions in (5.136), i.e. 


Инь) sin[m(2zx)] , 1£mzJj-1, 
(5.144) 
Wo = cos[m(2xx)] , 1<т</. 
Substitution of (5.136) into (5.121) and evaluation of (5.5) based on (5.144) produces 
the following system of ordinary differential equations: 
da 


m 20.— <m<J— 
di Ta(2:)a, 20, IsSmsJ-1, (5.145) 


db 
Fr + (0лт)%,-0 ‚ l£msJ. 


The system of equations (5.145) are explicit and have solutions 
а,(Ө-а,(0)6742594) LSmSJ—-1, 


b,(0—b,(0e^*0*?"  1zmzJ, (5.146) 


where а„(0) and b,,(0) are given by (5.143) and a, (t) and ba(t) are given by (5.137). 

For the current problem, use of (5.136, 137 and 144) provides a more efficient 
algorithm than the spectral Galerkin formulation, based on (5.138 and 139). In 
general the tau method handles difficult boundary conditions with more flexibility 
than does the Galerkin spectral method (Sect. 5.6.1). The tau method is described 
by Fletcher (1984, p. 207) and Canuto et al. (1987). 


5.6.3 Pseudospectral Method 


The Galerkin spectral method (Sect. 5.6.1) produces very accurate solutions with 
relatively few unknown coefficients а, in the approximate solution (5.123). How- 
ever, when nonlinear terms are involved the evaluation of products of approximate 
solutions becomes very time-consuming. This lack of economy motivates the use of 
collocation (5.8) instead of a Galerkin formulation. Collocation facilitates seeking 
the solution in terms of nodal unknowns, like the finite difference and finite element 
methods, rather than in terms of the unknown coefficients a; in the approximate 
solution. The explicit use of nodal unknowns also permits boundary conditions to 
be incorporated more efficiently than does the Galerkin spectral method. In the 
literature the collocation spectral method is usually referred to as the pseudospec- 
tral method (Orszag 1971). 

Here the pseudospectral method will be illustrated for the diffusion equation 

ou — Qu 

A 15:70 ; (5.147) 
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with boundary and initial conditions 
#(-10=-1,  u(Lt)21, a(x, 0)=sin(xx)+x . (5.148) 


The time derivative in (5.147) is discretised using a finite difference approximation. 
Using Euler time differencing, as in Sect. 5.6.1, produces the algorithm, 


д? п 

EG At . (5.149) 
ox 

The pseudospectral method requires that (5.149) be satisfied at each collocation 

point (х;) and introduces an approximate solution 


N+1 
u(x, )= У, A)T- (x) (5.150) 

- k=1 
that will allow spatial derivatives to be evaluated. The approximate functions in 
(5.150) are Chebyshev polynomials, defined in the interval — 1 £x <1. Chebyshev 
polynomials permit accurate determination of spatial derivatives and are generally 
preferred to trigonometric functions if u is nonperiodic. Chebyshev polynomials 
and their general use are described by Fox and Parker (1968). Specific recurrence 
relations that arise in conjunction with the spectral and pseudospectral methods 

аге given by Gottlieb and Orszag (1977, рр. 159-161). 


The direct use of (5.150) leads to the following expression for д2и/дх? |, in (5.147): 


д? мі GT, 

с = 2-16) | (5.151) 
дх j k=l дх . 
Thus the steps in the pseudospectral method to advance the solution from time 
level n to n+1 can be expressed as: 


i) given и" evaluate a; from (5.150); 
ii) given аҙ evaluate [д2и/дх?]" from (5.151); 
iii) evaluate uj*" from (5.149). - 


It can be seen that although step (ii) takes place in spectral space, i.e. in terms of 
a,, step (іш) takes place in physical space, ie. in terms of и). Step (i) is the 
transformation from physical space to spectral space. 

The efficiency of the pseudospectral method depends on the economical ex- 
ecution of the three steps. The evaluation of az from (5.150) appears to require the 
solution of a linear system Ta" — и", with T being dense. The direct use of Gauss 
elimination (Sect. 6.2.1) to obtain a" implies O((N + 1)?) operations. Fortunately, 
the use of Chebyshev polynomials evaluated at collocation points x; given by 


x о) ЧЕ? | , (5.152) 
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allows а fast Fourier transform (FFT) to be used to evaluate a", in O((N +1) 
log(N +1) operations. Fast Fourier transforms (Cooley and Tukey 1965) are 
described in more detail by Brigham (1974). 

The direct evaluation of 0?u/Ox? from (5.151) at all grid points x; requires 
O((N + 1)?) operations. However, the use of Chebyshev polynomials makes avail- 
able various recurrence relationships leading to a lower operation count to 
complete step (ii). In particular, spatial derivatives can be defined directly in terms 
of undifferentiated Chebyshev polynomials, i.e. 


ди Ë D 
zo), РТ, 100) (5.153a) 
k=1 
and 
2, N-1 | 
= aT, (x) . (5.1536) 
кеі 


Explicit relationships between 22, a(? and a, іп (5.150) are given by Gottlieb 
and Orszag (1977, p. 160). Specifically, the following recurrence relations permit ail 
the 000), al?) coefficients to be obtained іп O(N) operations 

A =а +2, , | 2Sk«N, 


ap-aQur2kd),, 2<КЕМ-1, 


(5.154) 


апа | 
а? =0.5 каз, а02=0.5а0 pat? 

апа 
Pasah, ар-ар,-0. 


Consequently д2и/дх?|, сап be evaluated using ап FFT in О((М- 1)log(N — 1)) 
operations. Since each step of the pseudospectral algorithm is economical the 
overall algorithm is very efficient. 

Nonlinear terms in the governing equation are handled by the pseudospectral 
method in a particularly economical manner. Consider the inviscid Burgers 


equation (10.2) 
Qu Qu 
L pī = 5.155 
at Ox ( 


Following the same procedure as above, the pseudospectral method consists of 
the following steps to advance the solution from time level n to n+1: 


i) given u; evaluate ах from (5.150); 
ii) given ах evaluate Гди/дх1) from (5.153a, 154); 
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iii) evaluate w} =u5[du/dx]j; 
iv) evaluate u?* = и" —w7At. 

The evaluation of the nonlinear term, step (iii), takes place in physical space, which 
avoids the expensive multiplication of individual coefficients in the expansions such 
as (5.150) and (5.153). 

The concept of transforming between spectral space and physical space using 
the FFT to efficiently evaluate nonlinear terms is also used with the Galerkin 
spectral method (Orszag 1971). A detailed example in a meteorological context is 
provided by Haltiner and Williams (1980, pp. 193-196). 

The pseudospectral method has one disadvantage compared with the Galerkin 
spectral method. In treating nonlinear terms the pseudospectral method allows 
aliasing errors to occur (Fletcher 1984). That is, the product of two series solutions, 
as is implicit in step (iii) above, causes new higher frequencies, k> N+ 1, to arise. 
However, since the collocation process only considers N — 1 discrete internal points 
the high frequencies are not properly accounted for. In practice a slight corruption 
of the lower frequency amplitudes occurs. 

For physical problems with little natural dissipation a time integration can 
become unstable due to the accumulation of aliasing errors producing a nonlinear 
instability. This 1s often a problem in weather prediction. For "engineering" flows 
there is usually sufficient dissipation, for example due to turbulence, that aliasing 
errors can be ignored. In this case the pseudospectral method is generally preferred 
to the Galerkin spectral method because of its greater economy and more effective 
treatment of boundary conditions. 

The Galerkin spectral method (Sect. 5.6.1) obtains the solution entirely in 
spectral space, ie. in terms of the coefficients a; in (5.123). The pseudospectral 
method, as described above, works partly in spectral space and partly in physical 
space, with the FFT used to shift between spectral and physical space. It is possible 
to reinterpret the pseudospectral method so that it resembles a discretisation in 
physical space like the finite difference and finite element methods. The essential 
steps in this interpretation (due to Ku and Hatziavramidis 1984) are described 
below. 

The nodal values uj are related to the spectral coefficients a, through (5.150), 
which can be written collectively as 


и= Та. (5.156) 
The coefficients а" are given explicitly in terms of и” by 
а= Ти, (5.157) 


where ап element of T is 


T,-1(x;) , (5.158) 
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€,=€n41=2 , 6,=1 3 l<i<N+1 . 
Equations (5.153a) evaluated at all grid points x; can be written 


ди 
2-14”. (5.159) 


But a! can be linked to a ма (5.154), 
a0 — GOg , (5.160) 


where G” is an (N +1) x (N +1) matrix with elements 


GW=0 "^ if k21 or Е-І еуеп, (5.161) 
к” , Otherwise . 


The coefficients c, 22, c,=1 for k>1. 
Clearly, from (5.159, 160 and 157) one obtains 


% = IGUa- TG Tu=Gy , (5.162) 


Consideration of a single element of the vector ди/дх indicates that (5.162) 
provides, directly, a discretisation formula for ди/дх|; in terms of all the и, nodal 
values. This may be contrasted with the tridiagonal structure of С“ if a centred 
finite difference expression were used to represent ди/дх|,. 

In an equivalent way, (5.153b) can be written 


Qu 

a= Ta” р (5.163) 
апа 

a? 2G 06402606 mg642G08 . (5.164) 


Elements of G® can be evaluated from (5.161). From (5.163, 164 and 157) 


ди 2 nr, AQ 
= 5--4М а= 1 4и = су H . . 
353 GOa- TGO Tu= GO (5.165) 


The matrices 6 and G® сап be evaluated once and for all and stored. 
Subsequently all operations, e.g. at each time level, are executed in physical space. 
The application of the present construction to the diffusion equation (5.147) 
would replace steps (i) and (ii) with the evaluation of [@?u/@x?]} from (5.165). It may 
be noted that this is an O(N) process whereas the evaluation of steps (i) and (ii) 
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involve two fast Fourier transforms and an O(N) recurrence. Because of the 
overheads involved in applying the FFT the use of (5.165) is competitive for N 
typically less than 100 (Ku et al. 1987). 

For the rather extreme case of N=2 in (5.150) the above pseudospectral 
procedure produces the same discretisation of the diffusion equation (5.147) as the 
FTCS scheme (3.4). 

The explicit matrix interpretation of the pseudospectral method will be con- 
sidered further in Sect. 17.1.6 in relation to computing solutions for incompressible 
viscous flow. More advanced treatments of various aspects of spectral methods 
may be found in Gottlieb and Orszag (1977), Fletcher (1984), Voigt et al. (1984), 
Peyret (1986) and Canuto et al. (1987). 


5.7 Closure 


The method of weighted residuals provides a general framework for comparing 
many approximate methods (Fletcher 1984, pp. 32-39). Here the Galerkin finite 
element, Galerkin spectral and finite volume methods have been considered, in 
more detail. 

Like finite difference methods, both the finite volume and finite element 
methods are local methods. Consequently both methods produce discretised 
equations that are sparse, although not as sparse, typically, as a finite difference 
method. Both the finite element method, through the isoparametric formulation, 
and the finite volume method are suitable for irregular computational domains. 
Both methods can also be applied with generalised coordinates (Chap. 12). 

The finite volume method has the additional advantage of discretising, directly, 
the conservation form (Sect. 11.2) of the governing equations. This implies that the 
discretised equations preserve the conservation laws, allowing for the discretisation 
error. 

By contrast the spectral method is a global method. In practice this makes it a 
more difficult method to implement but does permit approximate solutions of high 
accuracy with relatively few terms in the approximate solution, (5.2). The relative 
merits of the finite difference, finite element and spectral methods are compared by 
Fletcher (1984, Chap. 6). Another powerful global method, orthogonal collocation, 
is compared with the finite difference and finite element methods by Finlayson 
(1980, Chaps. 4-6). 


5.8 Problems 


Weighted Residual Methods: General Formulation (Sect. 5.1) 


51 The equation d?^y/dx? + y 2 (1— (51/6)?]sin(5nx/6), subject to the boundary 
conditions y(0) «0, y(1) 20.5, is to be solved іп the domain 0€ x € 1.0 using 
the following methods of weighted residuals: 


5.2 


5.3 
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(а) Galerkin, 

(b) subdomain (0 £ x $0.5; 0.5 х < 1.0), 

(c) least-squares. 

Assume an approximate solution of the form 


y-a,(x—x?)d- a4(x? — x3) +0.5х3 


Compare the computational solutions with the exact solution y —sin (57x/6). 


With a suitable nondimensionalisation the diffusion or heat conduction 
equation can be written 


28 29. 
ôt ax? 


For initial conditions 6(x, 0) = sin(zx) 4- x and boundary conditions 0(0,4)--0, 
9(1, 2) = 1.0, solutions to the above equation are to be obtained in the 
computational domain 0€ x € 1.0, 0 €t $0.20, using the following methods: 
(a) Galerkin, 

(b) subdomain, 

(c) collocation. 

The following approximate solution should be used 


N 
0—sin(zx)-- x У a,(tx/—x7*!) . 


j=1 


- 


Application of the above methods of weighted residuals to the nondimen- 
sional heat conduction equation produces a system of ordinary differential 
equations for the coefficients a;(t). These equations are marched in time via 
numerical integration. 

Obtain solutions (after writing a computer program) for N —3, 5, 7 for the 
three methods and compare the accuracy and rate of convergence, The exact 
solution for this problem is 


O=sin(nx)e"**+x . 


Viscous flow in a rectangular duct is governed by (5.97) 


b 009 OW, io 

aj o! ду? 
subject to boundary conditions W=0 at x= +1, y= +1. The ‘exact’ solution 
for this problem is given by (5.110), if NEM is large enough. Approximate 
solutions are to be obtained using the 
(a) Galerkin, 
(b) subdomain, 
(c) collocation 
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methods. As an approximate solution, use 
N 
w= Lax 0-y» , 
= 


which is suggested Бу the appropriate form of the limiting one-dimensional 
velocity profiles (b/a 20 and oo) Obtain solutions for N=1, 2 and 3 and 
compare with the finite element solutions obtained in Sect. 5.5.2. Comment on 
the accuracy per nodal unknown in relation to the appropriateness of the 
assumed approximate solution. In this regard include (5.110) with small 
values of NEM, since it is also a Galerkin solution (Fletcher 1984, p. 13). 


Finite Volume Method (Sect. 5.2) 


5.4 


5.5 


5.6 


Obtain solutions to Laplace’s equation in the region shown іп Fig. 5.4, using 
FIVOL, for the following parameter values: 

(a) ry = 0.10, гх = 4.00, гу = 1.00, rz =0.10, 0, x =0, 07, =90 ; 

(b) ry — 0.10, rx — 8.00, Гу = 1.00, rz = 0.10, 0, x =0, 07, —90 . 

For each case obtain results for the three grids, IPMAX =КМАХ =6, 11, 21. 
Compare the accuracy and rate of convergence with the results shown in 


Table 5.5. 
Extend program FIVOL to solve the equation 


дф 2% (% 2%), s 


дх "бу 74 дх? + ду? 


The source term 5 тау be treated in the same way as ĝq/ôt in (5.23). Test the 
program for the case 5 = [cos(26) —sin(20)]/r? by obtaining solutions for the 
same parameter values as used to obtain Table 5.5 and «21. Compare the 
solutions with the exact solution ф = (sin6)/r. The boundary conditions for 
the above choice of S are given by (5.31). 

The finite volume method, as described in Sect. 5.2, is to be applied to the 
situation where a Neumann boundary condition is given on WX, c.g. 


дф 1 


ду WX T wx 
The rest of the formulation is the same and the problem has the same exact 
solution as in Sect. 5.2. 

To obtain equations on WX follow the same procedure as shown in Fig. 5.5 
but use a “half-sized” finite volume centred on WX and only extending into 
the computational domain. The equivalent of (5.34) can be obtained directly, 
and approximated as in FIVOL except that 2ф/0х|»х may be obtained from 
the boundary condition. The term дф/дх|ух can be evaluated conveniently 
from a three-point finite difference approximation since WX coincides with 
the x direction. Make the necessary modifications to FIVOL and test for the 
same conditions as shown in Table 5.5. 
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Finite Element Method and Interpolation (Sect. 5.3) 


5.7 


5.8 


5.9 


Apply linear and quadratic finite element interpolation to the function 


y-(1-sin zx)e ?* 

over the range 0 €x € 1.0. Obtain the interpolated function values for 3, 5, 9 
and 17 equally spaced nodal points spanning the computational range. 
Calculate the rms error based on 20 equally spaced points in the range 
0<х<10, and demonstrate that the theoretically predicted convergence 
rates are achieved. 

For the function considered in Problem 5.7 carry out linear and quadratic 
finite element interpolation on a geometrically growing grid with a growth 
factor of 1.2 for the following cases (OS x € Xma): 

(a) 4x 202, 0.24, 0.288, 0.3456, .. . ; 

(b) 4x 20.1, 0.12, 0.144, . . . ; 

(с) Ах --0.05, 0.06, 0.072, ... . 

Terminate the grid as soon as х > 1.0. For quadratic interpolation adjust each 
midpoint so that x; , =0.5(х;-› + xj) etc. Compute the rms errors based on 
201 equally spaced points and compare with the equally spaced interpokated 
solution for approximately the same number of points spanning the domain. 
Consider the solution behaviour and contributions to the rms error in 
different parts of the computational domain and determine whether an 
interpolated solution of a prescribed accuracy could be obtained more 
economically by selective placement of the nodal points. 

Obtain linear and quadratic interpolated solutions, on finite elements, of the 
function 


)-(1--віплхе” 2*іп(лу) 


in the computational domain O€ x X1, 0 Ey € 1. Compute the rms error on a 
uniform grid five times finer than the finest nodal-point grid. Obtain the rms 
error for the same number of elements as shown in Table 5.7 and compare the 
accuracy and rate of convergence with the results shown in Table 5.7. 


Finite Element Method; Sturm-Liouville Equation (Sect. 5.4) | 


5.10 As indicated in Appendix A2, the contributions to the various one-dimen- 


sional finite element operators are obtained from the formulae 
1 
М,-2210,%;4х А 
1 (аф, 
L,=—|— 
* Ах” dx фах, 


L= 14645, 


Ax! ах dx 45: 
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Show (by evaluation of the integrals) that, for linear elements on a uniform 
grid, the above operators centered at the kth node have the values 


121 0.5 05 1 2 1 
М 22)», Пұт ——, 0, — , L= ор з», 9 - 
* (8 3 4) ( Ах 4) ^ (za Ax? 43) 


5.11 Modify program STURM so that it will cope with boundary conditions of the 
form 


У(0)=а , 99 1) =b Е 
dx 


This will require modification to the evaluation of b,, ; and gm when m= J, 
taking into account all non-zero contributions arising in (5.73). Also some 
adjustment of the tridiagonal system of equations will be necessary prior to 
solution by BANFAC/BANSOL. Test the modifications by solving numeri- 
cally the equation 


dy 25r? 5лх 

—5+y=F=(1-—— —-], 

dx?” ( 36 y 6 ) 

with boundary conditions p(0)= 1, dy/dx(1) = —5z/12 in the range OS x < 1.0. 
This problem has the exact solution y —cos(5zx/6). Obtain solutions using 
both linear and quadratic interpolation for the grid sizes shown in Table 5.10 
and compare the accuracy and convergence rate. 


5.12 Apply the Galerkin finite element method, with linear and quadratic interp- 
olation, to the equation 


with the boundary condition У(0) = 1.0 in the range 0 <x € 1.0. This problem 
has the exact solution. у=ехрх. Solve the implicit equations using 
BANFAC/BANSOL as in STURM. Obtain solutions on a uniform grid of 
increasing refinement until the rms accuracy is approximately equal to those 
shown in Table 5.3. Comment on the relative merit (e.g. accuracy and 
computational efficiency) of using local low-order approximating functions 
(as in the finite element method) with global higher-order approximating 
functions as in the traditional MWR (Sect. 5.1). 


Finite Element Method—Further Applications (Sect. 5.5) 


5.13 Obtain solutions for viscous flow іп a rectangular duct, using program DUCT 
on an 11 x 11 grid, for various values of b/a until the centreline solution across 
the smaller dimension is within 1% rms of the one-dimensional parabolic 
profile, e.g. и= 1.5(1 — y?). 
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5.14 Modify program DUCT to solve Laplace’s equation in the domain 
0<х<1.0, OS yS1.0, subject to the boundary conditions 


№(х,0) =со$(0.5лх) ,  w(Ly)2-0,  w(x,1)—e??"cos(0.5nx) , 
ow 

hid -0. 

зо (0:9) 


This problem has the exact solution № = со$(0.5лх)ехр(0.5лу). When applying 
the finite element method to this problem the evaluation of the Galerkin 
equations centred on the line x=0 will only involve integrations over two 
contiguous elements. For homogeneous Neumann boundary conditions 
(5.101) is still valid since the terms arising from the integration by parts are all 
zero. Demonstrate that evaluation of the Galerkin equations over the two 
contiguous elements makes no difference to LT, but replaces (5.104, 105) with 
the expressions 


1 1 1 1 
Г... = 0, —.—, х= РГР ИО 
хх ( Ах? т) М (0 3 z) 


Obtain numerical solutions to the above problem on 6x6, 11x11 and 
21x21 grids and compare the accuracy of the finite element and finite 
difference solutions. 

5,15 Modify program DUCT to solve the equation 
oj oj sé P 


ax y “ax? Pay?” | 


іп the domain 0€ x < 1.0, 05у< 1.0, subject to the boundary conditions 


$(x,0) =e , ф(1,у) — elite? , 


pO yy =ef, ф(х,)=е вех", 


This combination has the exact solution ф(х, у) = exp(x/ajexp( y/f). For the 
parameter values а= 1.0, 3 = 1.0, obtain solutions with a linear finite element 
formulation and a three-point finite difference formulation on 6 x 6, 11 x 11 
and 21 x 21 grids. Compare the accuracy and convergence rates of the finite 
element and finite difference solutions. 


Spectral Method (Sect. 5.6) 


516 Apply the spectral method to the diffusion equation (as described in 
Sect. 5.6.1) with initial conditions 


T(x,0) = 5x — 4x? 
and Dirichlet boundary conditions 


Т(0,)=0, 7(1,9=1. 
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Demonstrate that the results shown in Table 5.13 can be reproduced. Also 
obtain solutions for J — 7 and 9 and compare with the approximate solution 
T, and exact solution T. 

Apply the spectral method to the diffusion equation with the initial condition 


T (x, 0)=3—2x—2x? + 2x? 
and mixed Dirichlet and Neumann boundary conditions (as in Sect. 5.6.2) 


OT (0, t) 


--2, T(L)-10. 
Ax » Таз 





Obtain numerical solutions with х = 1.0 using J=3, 5 and 7 in (5.136) up to 
t —0.1. Compare the computational solutions with the exact solution 


N 
T-3-2x- Y [2 

=1 \ (27—1)т 
to determine the rate of convergence with J. Confirm numerically that the 


steady-state solution 7,,=3-—2x is achieved by numerically integrating to 
large time. 





3 
) біп(2)-1)лх1е “2-1 


6. Steady Problems 


Many of the examples considered in Chaps. 3-5 have included time as an indepen- 
dent variable and the construction of the algorithms has taken this into account. 
However many problems in fluid dynamics are inherently steady, and the govern- 
ing equations are often elliptic in character (Sect. 2.4). 

As indicated in Sect. 3.1.2 any of the major methods, finite difference, finite 
element, finite volume or spectral can be used to discretise the steady flow 
equations, and will produce systems of equations that can be written 


A(V)V=B, (6.1) 


where V is the vector of unknown nodal values or coefficients in the trial solution 
(5.2). A contains the algebraic coefficients arising from discretisation and, in 
general, may depend on the solution (И) itself. В 15 made up of algebraic coefficients 
associated with discretisation and known values of Г, e.g. that are given by the 
boundary conditions. 

The matrix A(V) is typically sparse (few nonzero elements) and the nonzero 
terms are close to the diagonal, for finite difference, finite element and finite volume 
methods. For the spectral method A(V) will be full (few zero elements). In Fig. 6.1 
the elements p, 4, г, s and t in А are associated with the corresponding nodal 
positions. The elements p and Е associated with nodes (j, k+1) and 
(j, k — 1) are displaced from the diagonal of 4 by N, the number of unknown nodes 
in the x-direction. 

It is possible to construct iterative techniques for the solution of (6.1) as it 
stands. However, it can be more efficient to set up an outer iteration for which (6.1) 
is made linear at each step so that direct methods (Sect. 6.2) can be employed. 
Newton's method is of this type and is considered in Sect. 6.1. 


хх 
| | | ххх x 
p К+1 оо 
| Фо è 
—q-r—s—k qrs t 
! ket x ххх х 
| 742. 
j! j i*! x XXX 
x хх 


Fig. 6.1. Typical nonzero element pattern in A after a finite difference discretisation 
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If A does not depend on V only one step of the outer iteration is required, i.e. 
(6.1) can be written 


AV=B. (6.2) 


Direct methods (Sect. 6.2) or iterative methods (Sect. 6.3) may be used to solve (6.2). 
Subroutines FACT and SOLVE are provided in Sect. 6.2.1 to carry out the direct 
solution of AV = B, when А is full, using Gauss elimination. When А is tridiagonal 
or alternating tridiagonal and pentadiagonal, subroutines BANFAC and BAN- 
SOL obtain the solution V also by Gaussian elimination, but much more efficiently 
than subroutines FACT and SOLVE. For certain structures of А, such as that 
produced by a finite difference discretisation of the Poisson equation, special 
procedures are available (Sect. 6.2.6). 

Iterative methods (point Jacobi, Gauss-Seidel and SOR) are illustrated in 
Sect. 6.3.2 for the Poisson equation arising from the viscous duct flow problem 
(Sect. 5.5.2). The influence of grid refinement on the optimum relaxation parameter 
and the convergence rate of point SOR is examined. The extension of point SOR to 
line SOR and ADI is demonstrated and a comparison with point SOR made. 
Conjugate gradient (Sect. 6.3.4) and multigrid (Sect. 6.3.5) methods are described 
primarily as means of accelerating “classical” iterative techniques. 

Some iterative methods, when applied directly to (6.1), are conceptually similar 
to constructing an equivalent unsteady formulation of the original problem. When 
it is recognised that the transient solution is of no particular value it is possible to 
modify the time-dependent terms so that convergence to the steady state is 
achieved with as small an execution time as possible. This is the motivation for 
pseudo-transient methods (Sect. 6.4). 


6.1 Nonlinear Steady Problems 


Steady flow problems are often nonlinear due to the nature of the convective terms 
or, less often, due to the dependence of the flow properties on the solution, e.g. 
viscosity on the temperature, typically. 

Newton’s method in several independent variables is a potentially useful 
technique for solving steady flow problems because of the rapid convergence when 
close to the solution. The method will be illustrated here for two problems. First a 
system of highly nonlinear algebraic equations will be solved. Second the two- 
dimensional steady Burgers’ equations (Sect. 10.4) will be considered, since these 
equations have the same nonlinear structure as the incompressible Navier-Stokes 
equations (Sect. 11.5). 


6.1.1 Newton’s Method 


Newton’s method is a powerful technique for solving nonlinear equations like (6.1). 
This method is shown schematically in Fig. 6.2. 
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Fig. 6.2. Schematic representation of Newton's method 


If (6.1) is written 

R-A(V)V-B-0, (6.3) 
then Newton's method can be written as 

рк YO _(Je)-1R® , (6.4) 


where k is the iteration level and J is the Jacobian. An element of J“ is 


aR” 
ЛЕ. (6.5) 
J дү® 
Equation (6.4) сап be rewritten in the form 
J?Ay**D0o go. (6.6) 


Equation (6.6) is a linear system of equations that is to be solved for the correction 
vector 4V** D — y** 0 y® at each stage of the iteration. 

An attractive feature of Newton's method is that it demonstrates quadratic 
convergence if the current iteration V? is sufficiently close to the converged 
solution V,. Quadratic convergence implies 


ИИ V-I? . (6.7) 


A criterion for the convergence of Newton’s method can be developed as follows 
(after Isaacson and Keller 1966). If J® has an inverse with its norm bounded by а, 
1.е. 


IJO <а, 
and if the first correction vector A И) has a norm bounded by b, i.e. 


lay = -JORO <b , 
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and if R? has continuous second derivatives which satisfy 


N 











for all V in |417“? | «2b, and if the constants satisfy the relation 
abc<05, (6.8) 


then Newton’s method will converge to the solution 


lim V9 =V 


с> 
k= co 


at which R(V.)=0 and || V9 — И, | xb/(2*- !). 


The vector norms are maximum norms, e.g. | V| =max|V;| and the matrix norms аге 
i 


maximum natural norms, e.g. 
N 
|Z] 2 max УЛ. 
і іші 


It should be noted that the evaluation of the inequality (6.8) involves the same 
order of work as attempting to solve the problem itself. The main difficulty with 
Newton’s method is that the radius of convergence, b, decreases as N increases 
(Rheinboldt 1974), so that V must be close to V, to ensure convergence. 

The main contribution to the execution time in using Newton's method is the 
factorisation of J, in the solution of (6.6). It is possible to reduce the execution 
time by freezing the value of J® for a number of steps Ak. That is J® need only be 
factorised once every Ak steps. However more iterations are required, typically, to 
reach the converged solution. Methods for solving nonlinear systems of equations, 
including Newton's method, are discussed by Ortega and Rheinboldt (1970). 


6.1.2 NEWTON: Flat-Plate Collector Temperature Analysis 


For a flat-plate solar collector, shown schematically in Fig. 6.3, an energy balance 
for the absorber and two glass covers might produce the following system of 
nonlinear equations connecting the absolute temperatures of the absorber (Т,) and 
the two glass covers (Т, and Т): 


В, =(Т*+0.06823Т,)-(Т4+0.05848Т.)— 0.01509 =0 , 
R4 -(T44-0.05848T,) — QT 4-0.11696T,) -( T4 4-0.05848T,) -0 , (6.9) 
R,—(T* + 0.05848T;) — (2.05 T4 + 0.2534T,) + 0.06698 =0. 


In (6.9) T,, Т, and T, are absolute temperatures/1000 to make the equations better 
conditioned. 
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glass cover 


Salar radiation | Fig. 6.3. Flat plate solar collector 
| temperature, Тэ 


55 
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Multicomponent Newton's method (Sect. 6.1.1) applied to (6.9) requires the 
solution, at each iteration step (k), of 


Jo4T**D- — RO , © (6.10) 


for AT **'. The solution is then updated by 





glass cover 
temperature, T^ 





absorber 
temperature, T, 





TeV TH, Apter) . 


In (6.10), J;=0R,/0T;. 
For this problem, 


(4T? +0.06823) -(4Т3--0.05848) 0. 
J= | (473+0.05848) —(8Т3+0.11696)  (473+0.05848) |. (6.11) 
0. (4T34-0.05848) —(8.2T3+0.2534) - 


This problem has been incorporated into a computer program, NEWTON. А flow 
chart for NEWTON is shown in Fig.6.4 and the listings of the subroutines 
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Fig. 64. Flow chart for program NEWTON 
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ic : 
2c NEWTON APPLIES NEWTON'S METHOD TO SOLVE A NONLINEAR SYSTEM OF 
3c ALGEBRAIC EQUATIONS, R(T). 
4С RESID EVALUATES THE RESIDUALS 
БС ЈАСОВ EVALUATES THE JACOBIAN 
6c FACT FACTORISES THE JACOSIAN INTO L.U 
7C SOLVE SOLVES THE LINEAR SYSTEM FOR DT 
8c 
9 DOUBLE PRECISION SUM,RMSR,RMST,DSQRT 
10 DIMENSION T(50).4J(50,50).R(50), JPVT(50), TEX(50) 
11€ 
12 OPEN(1,FILE=’ NEWTON. DAT’) 
13 OPEN(6,FILE=’ NEWTON. OUT’) 
14 READ(1,1)N, ITMX,EPS 
15 READ(1,2) (T(J2, J21, W) 
16 READ(1,3) (TEX (J), J^ 1.3) 
17 1 FÜRMAT(2I5,E10.3) 
18 2 FORMAT(10F5.2) 
19 3 FORMAT(7F10.7) 
. 20€ 
21 УВТТЕ (6.4) И, ITMX,EPS 
22 4 FORMAT(’ NEWTONS METHOD FOR И -',I3,' ITMX= 'I3,' EPS= ',E10.3) 
23 WRITE(6,5) (T(J), J=1,0) 
24 5 FORMAT(’ INIT SOLE =’,3F5.2,//) 
25 Мә: 
26 ІТ = 0 
27 6 CONTINUE 
28 C 
29 С CALCULATE RESIDUALS 
30 C 
31 CALL RESID(N,T,R) 
32 € 
33 SUM = 0. 
34 DO7Is1, 
35 7 SUM = SUM + R(I)*R(I) 
36 RMSR = DSQRT(SUM/AXN) 
37 SUM = 0, 
38 pos I= 1," 
39 8 SUM = SUM + (TCI) - TEXCD)**2 
40 RMST = DSQRT(SUM/AN) 
41 WRITE(6,9)RMSR, RMST, (R(J),J21,3) 
42 9 FORMAT(? RMS RHz',D11,4,' RMS T -',D11.4,"? R=’,3D11.4) 
43 IF(RMST .LT. EPS)GOTO 12 
44 IT = IT+1 
45 IF(IT .Е0. ITMX)GOTO 12 
46 C 
47 C CALCULATE JACOBIAN 
48 C 
49 CALL JACOB(N,T,AJ) 
50 C 
51 С FACTORISE ТЕЕ JACOBIAN INTO L.U 
52 С 
53 CALL FACT(N,AJ, JPVT) 
54 C 
55 IF(JPVT(N) .EQ. ~1)WRITE(6,10) 
56 10 FORMAT(! ZERO PIVOT DETECTED’) 
57 ТРОРУТ(И) .EQ. -1)GOTO 15 


Fig. 6.5. Listing of program NEWTON 
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58 С 

59 С SOLVE FOR THE CORRECTION, DT 
60 с 

61 CALL SOLVE(N,AJ,JPVT,R) 
62 C 

63 C INCREMENT T 

64 C 

65 DO 11 I * ти 

66 ОТ = -R(I) 

67 11 T(I) = T(I) + DT 

68 GOTO 6 

69 С 

70 C GENERATE OUTPUT 

71€ 


72 12 WRITE(6,13)IT, RMST 

73 13 FORMAT(//,’ AFTER ',I3,' ITERATIONS THE RMS RESIDUAL IS ',E12.5) 
74 WRITE(6,14) (Т(І),1=1,0) 

75 14 FORMAT(* T = '7Р10.7) 

76 15 CONTINUE 

77 STOP 

78 EFD 


Fig. 6.5, (cont.) Listing of program NEWTON 


indicated in Fig. 6.4 are provided in Figs. 6,5-7. The various parameters used by 
program NEWTON are shown in Table 6.1. The residuals (6.9) are calculated in 
RESID (Fig. 6.6) and a test is made (NEWTON, lines 43-45) to stop if the rms 
residual is below tolerance or if the maximum number of iterations has been 
exceeded. Subroutine JACOB (Fig. 6.7) evaluates the terms in the Jacobian (6.11). 
The linear system (6.10) is solved in two stages. First the Jacobian is factorised into 
LU form by subroutine FACT and subsequently the reduced system 15 solved by 
SOLVE. Subroutines FACT and SOLVE are described in Sect. 6.2.1. - 
The solution of (6.9) by program NEWTON is shown in Fig. 6.8. The solution 
indicates that the temperatures of the absorber surface and two glass covers are 
415 K (142°C), 379 K (106°C) and 334 K (61°C), respectively. The rms error and rms 
residual are showing approximate quadratic convergence, as in (6.7), as the 
converged solution is approached. This example demonstrates how effective 
Newton’s method is for small systems of highly nonlinear algebraic equations. 


1 SUBROUTINE RESID(N,T,R) 

2с 

3c EVALUATES RESIDUALS REQUIRED BY NEWTON'S METHOD 
4c 

5 DIMENSION T(50),R(50),T4(50) 

6c 

7 0111,Ш 

8 DUM = T(J) 

9 1 T4(J) = DUMSDUMSDUMSDUM 

10 С = 0.05848 
11 DUM = T4(2) + C*T(2) 

12 РАМ = T4(3) + C*T(3) 

13 R(1) = T4(1) + 0.06823*Т(1) - DUM - 0.01509 
14 R(2) = ТА(1) + C*T(1) - 2.*DUM + DAM 

15 R(3) = DUM - 2,05*T4(3) - 0.2534*T(3) + 0.06698 
16 RETURN 
17 END 


Fig. 6.6. Listing of subroutine RESID 
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1 SUBROUTINE JACOB(N,T,AJ) 

2с 

зс EVALUATES JACOBIAN REQUIRED BY NEWTON'S METHOD 
4c 

5 DIMENSION Т(50),АЈ(50,50),Т3(50) 
6€ 

7 pO 2I= 1," 

8 D0 1Ј = 1,1 

9 1 АЈ(І,Ј) = 0 

10 руй = Т(1) 

11 T3(I) = 4.*DUM*DUM*DUM 

12 2 CONTINUE 

13 C = 0.05848 

14 AJ(1,1) = T3(1) + 0.06823 

15 AJ(1,2) = - T3(2) - С 

16 AJ(2,1) = T3(1) +С 

17 AJ(2,2) = - 2.*T3(2) ~ 0.11696 
18 AJ(2,3) = T3(3) + C 

19 AJ(3,2) = T3(2) +С 

20 AJ(3,3) = - 2.05*Т3(3) - 0.2534 
21 RETURN 

22 END 


Fig. 6.7. Listing of subroutine JACOB 


Table 6.1. Parameters used by the program NEWTON 





Parameter Description 

N number of equations 

IT iteration counter 

ITMX maximum number of iterations 

EPS tolerance on rms solution error 

T temperature, dependent variable 

R equation residuals, (6.9) holds AT on return from SOLVE 

RMSR rms residual 

RMST rms solution error, || T— T Ims 

AJ Jacobian, J 

JPVT index of kth pivot row (in FACT) 
JPVT(N)- —1, if zero pivot occurs 

DT AT, correction to Г 


NEWTONS METHOD FOR N = 3 ITMX= 10 ЕР5: .100Е-04 
INIT SOLN = .30 .30 .30 


RMS RH= „70230-02 RMS 
RMS RH= ,9400р-02 RMS 
RMS RH= „13170-02 RMS 
RMS RHe .23720-04 RMS 
RMS АН= = .5353D-08 RNS 


.8307D-01 R= -.1216D-01 .6971D-09 -.1007D-05 
.2885D-01 R= .1344D-01 .71220-02 .4993р-02 
-3148D-02 R= .1738р-02 .14490-02 .2885р-03 
‚46470-04 R= .29280-04 .28810-04 .44790-06 
.48670-07 R= .6588D-08 .55700-08 „33990-08 


CEECEE) 
инини 


AFTER 4 ITERATIONS THE RMS RESIDUAL IS .486675-07 
T= 4151283 .3794904 .3335792 


Fig. 6.8. Typical output produced by program NEWTON 
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6.1.3 NEWTBU: Two-Dimensional Steady Burgers’ Equations 


The unsteady form of these equations are discussed more fully in Sect. 10.4. Here 
the steady two-dimensional Burgers’ equations 


204,04 1 да дй _0 

ôx ду Ве\дх? ду?) ^" 
200 0001 (08 0%) | (6.12) 
и — —— —— | =——5 — | = 

дх "oy Ке(дх2 ду? , 


with Dirichlet boundary conditions on u and v are to be solved using Newton's 
method after discretisation using three-point centred difference formulae. The 
discretised form of (6.12) can be written 


1 
Ru,,— ики к + v, yLyu; ,— Re (ик Ly uy; x) -0, 


(6.13) 
1 
Rv, p= uj 1,0, 50; 1,0% - Ве (1б ыы а )-0, 
where 
_ Ħj+1,k Hj auk 
La Mx =, (6.14) 
и, к= uj- Lk 2u, y uj. l.k 


Ax? , 


The solution is sought in the domain shown in Fig. 6.9. 
The boundary values of u and v are taken from the exact solution (Fletcher 
1983) 


(6.15) 


й------- апа ф- 


Кеф дх ~ Reġ ôy ' 
where 


p=a,+a,x+a,y+a,xy +a5(e**~ * 4. e 7 97 *9)cos(Ay) (6.16) 
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u2U(X, Ymax} Y* Y(X.Ymax) Fig. 6.9. Computation domain for 


Y *Ymax solution of (6.12) 





о =0(-1,у u о (1) 
v= У,у) У=У(1,у) 
узо 
uzU(x,0), v2¥{x,0} 
Хи-1 | ха} 


and а;,а,, аз, ад, аб, A and хо are chosen to produce different behaviour of the 
exact solution. 

The evaluation of (6.13) at all internal nodes provides 2(N X —2)(N Y — 2) 
equations that must be solved for и), and v; , for all internal nodes. The solution 
process, using Newton's method, can be written as 


"+=, (6.17) 


where 44 is made up of 4и, к, 4v; , and the vector R contains Ru,, and Кр,, 
evaluated at each internal node, The superscript (n) is used here to denote the 
iteration level. The Jacobian J contains terms like 


ORu, ,/Ouj—1,4= —0.5u, ,/Ax — 1/(ReAx?) , 

ORu, ,/0u; , — Q/Re)(1/Ax? + 1/4y?) + (0.5/Ax)(uj. ,. 4 —uj- 1,4) > 
ORu, ,/0u;, ,,, 70.5u; ,/Ax — 1(ReAx?) , 

дКи, ,/0u; ,-, = —0.5v, ,/Ay — 1(ReAy?) , 

ORu; ,/0u; к. 70. 5v, ,/Ay — 1/(ReAy?) , 

ORu, ,/0v; , = (0.5/ А у)(и, к+1 —Wjk-1) - 


Similar terms arise associated with Rv, x. 

The above formulation is implemented in program NEWTBU. Listings of 
NEWTBU, EXBUR to provide the exact solution, RESBU to evaluate (6.13) and 
JACBU to evaluate (6.18) are given in Figs. 6.10-13. Table 6.2 lists the parameters 
used in NEWTBU. 

The equations (6.13) are located in R in the order, 


(6.18) 


Ru; >, Ко, 2, Ки, з, Ко, з, heey Киз мү-1» КӘ; ny-1> Киз 2, Көз.» 4 . 
and іп 4 the velocities аге ordered іп а corresponding manner, 


И2,2» 02,2, М2, з, 02,35. 0-5 И2,мҮ-1» 02, NY 15 Из, 2: 03,25 - + 
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The exact solution of (6.12), given by (6.15) with parameter values 
a, =a,=110.13 А аз=а4=0 › а,-1.0 А А=5 , 
Ке-10, у,х-7/30 and x,=10, 


has been used as the starting solution for uw and b. However, program 
NEWTBU applied on а 5x 5 grid, і.е. 18 unknown и and v nodal values, fails to 
produce a converged solution. To obtain a converged solution it is necessary to 


1 
2с 

зс NEWTBU APPLIES NEWTON'S METHOD TO SOLVE BURGERS’ EQUATIONS 
а с FOR U(X,Y) AND V(X,Y). 

5c EXBUR EVALUATES THE EXACT SOLUTION OF BURGERS! EQUATIONS 
6c RESBU EVALUATES TEE RESIDUALS 

тс JACBU EVALUATES THE JACOBIAN 

8c FACT FACTORISES THE JACOBIAN INTO L.U 

9c SOLVE SOLVES THE LINEAR SYSTEM FOR DT 

10 С IRD CONTROLS READ/WRITE OF STARTING SOLUTION 

11€ IPN IS TEE INCREMENT FOR PRINTING ITERATIVE ВИЗ 

12 С 

13 DOUBLE PRECISION SUM,DSQRT,R,U,Y 

14 DIMENSION U(21,21),Y(21,21),UE(21,21) , VE(21,21), A (5) , R(50) 
15 1,AJ (50,50), JPVT(50) , RD(50) 

16 COMMON DI,DY,RE,NX,NY,DTI 

17 С 

18 OPEX(1,FILE-"NEVTBU.DAT') 

19 OPEN (6,FILE-'NEWTBU,QUT!) 

20 OPEN(7 ,FILE=’NEWTBU. STO’) 

21 READ(1, 1) ИХ, NY, ITMX, IRD,IPN,EPS,RE,OM,DT 

22 READ(1,2) (4(J),J721,5), AL 

23 1 ҒОВИАТ(Б15,4Е10.3) ^ 7 
24 2 FORMAT(5F8,2,F5.2) 

25 С 

26 WXP = NX - 1 

27 ЖҮР = WY - 1 

28 И = (ИХ ~-2)*(NY - 2)*2 

29 WRITE(6,3)N,ITMX,IRD, ТРИ 

30 З FORMAT(' NEWTONS METHOD FOR И =',13,5Х, *ТТМХ=: ‚ТЗ, 

31 1° ІШ-!,12,: IPN-',I2) 

32 WRITE(6 ,4)DT,EPS , ВЕ, ОМ 

33 4 FORMAT(’ DT-',E12.5,' EPS=’,£10.3,' RE=',£10.3, 

34 1: OMz',E10,3) 

35 WRITE (6,5)(a(J),J=1,5),AL 

36 5 FORMAT(? A z*,5F8.2,'  AL-',F5.2,//) 

arc 

38 C CALCULATE EXACT SOLUTION 

39 C 
40 CALL EXBUR(UE,VE,A,AL) 
41 С 
42 С GENERATE INITIAL SOLUTION 
43 C 

44 IF(IRD .LT. 2)GOTO 8 
45 C 
46 C IRD = 2 OBTAIN STARTING SOLUTION FROM NEWTBU.STO (UNIT 7) 
47 C 
48 DO 7 K = 1,JY 
49 READ(T,6) (U(K,J),Jz1,WX) 
50 READ(T,6) (V(K,J),J-1,WX) 
Бі 6 FORMAT(10F8.5) 
52 7 CONTINUE 

53 GOTO 11 


54 C Fig. 6.10. Listing of program NEWTBU 
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55 8 00 10 J = 1, их 
56 DO 9 K = 1, у 
57 UCK,J) = UE(K,J) 
58 V(K,J) = VE(K,J) 


59 9 CONTINUE 
60 10 CONTINUE 


62 11 DO 12 К = 1, У 

63 12 WRITE(6,13) (UE(K, J) ,J21, WX) 
64 13 FORMAT(! UE=’,7F10.4) 

65 DO 14 K = 1,JY 

66 14 WRITE(6,15)(VE(K,J), Ј=1,ИХ) 
67 15 FORMAT(’ VE=’,7F10,4) 


68 С 

69 АК = N 

70 ІТ = 0 

71 ITP = 0 

72 ОМН = ОМ 

73 DTI = 1./DT 

74 16 CONTINUE 

75 С 

76 С CALCULATE RESIDUALS 
77 с 

78 CALL RESBU(U,V,R) 
79 C 

80 SUM = 0, 

81 DO 17 I= 1," 

82 RD(I) = RCI) 


83 17 SUM = SUM + R(I)*R(I) 

64 18 RMS = DSQRT(SUM/AN) 

85 IF(IT .СЕ. ITP)WRITE(6,19)1T,RMS,(RC(J) , J=1,3) 
86 19 FORMAT(’ IT=',I3,’ RMS=',D11.4,’ Rz',3D11.4) 


87 C 

88 C TEST FOR CONVERGENCE OR MAXIMUM NUMBER 
89 C OF ITERATIONS EXCEEDED 

90 C 

91 IF(RMS .ІТ. EPS)GOTO 23 

92 IF(IT .GE. ITP)ITP = ITP + ТРИ 
93 IT = IT*1 

94 IF(IT .EQ. ITMX)GOTO 23 

95 IF(RMS .СТ. 1.0E*03)GOTO 30 

96 C 

97 C CALCULATE JACOBIAN 

98 C 

99 CALL JACBU(U,V,AJ) 

100 C 

101 C FACTORISE THE JACOBIAN INTO L.U 
102 C 

103 CALL РАСТ(И, АЈ, ЈРУТ) 

104 С t 
105 IF(JPVT(N) .EQ. -1)WRITE(6,20) 
106 20 FORMAT(' ZERO PIVOT DETECTED’) 
107 IF(JPVT(N) .EQ, -1)GOTO 30 

108 С 

109 С SOLVE FOR THE CORRECTION, DT 
110 С 
111 CALL SOLVE(N, АЈ, JPVT, RD) 
112 C 

113 C INCREMENT U AND V 

114 C 


Fig. 6.10. (cont.) Listing of program NEWTBU 





115 DO 22 J = 2, МХР 
116 DO 21 К = 2, МР , 
117 MB = 2*(К-2) + 2%(ИҮ-2)%(7-2) 
118 DU = - RDCMB+1) 
119 DV = - RD(MB+2) 
120 U(K,J) = U(K,J) + DUsOM 
121 V(K,J) = V(K,J) + DV*OM 
122 21 CONTINUE 
123 22 CONTINUE 
124 GOTO 16 
125 С 
126 С GENERATE OUTPUT 
127 С 
128 23 WRITE(6, 24)IT, RMS 
129 24 FORMAT(//,* AFTER ',I3,' ITERATIONS TEE RMS RESIDUAL IS ', 
130 1012.5) 
131 DO 25 К = 1,NY 
132 25 WRITE(6,26)(UCK,J) ,J=1, ИХ) 
133 26 РОВМАТ(’ Uz',7F10.4) 
134 DO 27 К = 1,8Ү 
135 27 WRITE(6, 28) (У(К,Ј), 2=1, их) 
136 28 FORMAT(’ V=’,7F10.4) 
137 С 
138 IF(IRD .EQ. 026070 30 
139 DO 29 К = 1,NY 
140 WRITE(7 ,6)(UCK, J), Jz1, NY) 
141 WRITE (7,6) (УСК, 7), Ј=1,ИХ) 
142 29 CONTINUE 
143 30 CONTINUE 
144 STOP 
145 END Fig. 6.10. (cont.) Listing of program NEWTBU 
1 
2 SUBROUTINE EXBUR(UE,VE,A , AL) 
3c 
4c CALCULATES THE EXACT SOLUTION OF THE TWO-DIMENSIONAL BURGERS’ 
5c EQUATIONS USING THE COLE-HOPF TRANSFORMATION 
6c 
7 DIMENSION A(5),UE(21,21),VE(21,21) ,PH(21,21) 
8 COMMON DX,DY,RE,NY,NY,DTI 
9 PI - 3.1415927 
10 XZ = 1.0 
11 УМАХ = PI/6./AL 
12 АМ - 1 
13 DY = YMAX/ANY 
14 ANX = WX -1 
15 DX = 2./aNI 
16 DO 2 J = 1,m% 
17 АЈ = Ј-1 
18 X = -1. + AJ*DY 
19 ID = X - IZ 
20 DEX = EXP(AL*XD) + EXP(-AL*XD) 
21 DDX = EXP(AL*XD) - EXP(-AL*XD) 
22 DO 1 K = 1,NY 
23 ak =K-1 
24 У = AK*DY 
25 SY = SIN(AL*Y) 
26 CY = COS(AL*Y) 
27 PE(K,J) = ACL) + A(2)*X + A(3)*Y + A(4)*YX*Y + A(BE)SDEXSCY 
28 РИХ = А(2) + A(4)*Y + A(5)*ALSDDXSCY 
29 PHY = А(3) + A(4)*X - A(5)*AL*DEI*SY 
30 ШЕ(К,)) = - 2.*PHX/PE(K, J)/RE 
31 VE(K,J) = - 2.*PHY/PH(K,J)/RE 
32 1 CONTINUE 
33 2 CONTINUE 
34 RETURN А 
35 END Fig. 6.11, Listing of subroutine EXBUR 
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1 
2 SUBROUTINE RESBU(U,V,R) 

3c 

4c EVALUATES RESIDUALS OF 2D STEADY BURGERS’ EQUATIONS 

БС 

6 DOUBLE PRECISION R,U,V,CX,CY,CCX,CCY,DUM 

T DIMENSION U(21,21),V(21,21),R(50) 

8 COMMON DX,DY,RE,NX,NY,DTI 

9c 

10 NXP = NY -1 

11 ШҮР = ШУ ~ 1 

12 СХ = 0,5/DI 

13 СУ = 0.5/DY 

14 CCX = 1./RE/DX/DX 

15 CCY = 1./RE/DY/DY 

16 ИСТ = 1 

17 € 

18 DO 2 J = 2,NXP 

19 IM=J-1 

20 JP2J*1 

21 DO 1 К = 2,NYP 

22 KM = K- 1 

23 KP = K + 1 

24 DUM = CX*U(K,J) *(U(K, JP)-U(K,JM4)) + CY*V(K,J) «(U(KXP, J)-UCKM,J)) 
25 R(NCT) = DUM - CCX*(U(K, JM)-2, *U(K,J) 4U(K, JP)) - CCY*(U(KX,J) 
26 1- 2.*U(K,J) + U(KP,J)) 

21 DUM = CX«U(K,J)* (V(K, JP) -V(K,JM)) + CY*VCR, J) (V(KP, J) -VCKM, J)) 
28 А(ИСТ+1) = DUM ~ CCI*(V(K,JM) -2.*V(K,J) *VCK, JP)) - CCYS(V(RM, J) 
29 1 - 2.*V(K,J) + V(KP,J)) 

30 ИСТ = ИСТ + 2 
31 1 CONTINUE 

32 2 CONTINUE 

33 C 

34 RETURN 

35 END Fig. 6.12. Listing of subroutine RESBU 


underrelax the correction of 4”. Thus the following equation is used: 
q"* Y= + 049" * 0 (6.19) 


with w=0.15. However, the convergence rate is then quite slow (Fig. 6.14). This 
situation is not unusual in applying Newton's method to the discretised equations 
governing flow problems. Namely that the corrections calculated by Newton's 
method are sufficiently large as to throw the current solution outside of the radius 
of convergence (Sect. 6.1.1), if the corrections are implemented in full. 

Equation (6.19) suggests a related strategy. This is to treat 444%) obtained 
from the solution of (6.17) аз a search direction. For three values of œ, new 
solutions 


4" 0= 4" o Aq" 0) 


are obtained and corresponding residuals А" P? evaluated. For each value о, an 
rms residual КОТ) is calculated and a quadratic dependence of К" on œ 
assumed. Then the value of œ, and hence 447"), is chosen that leads to the 
minimum R,,,, for the given search direction 4447), However, this strategy can be 
computationally expensive if the residuals Ru and Rv are complicated. 
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SUBROUTINE JACBU(U,V,AJ) 

EVALUATES THE JACOBIAN OF THE 2D STEADY BURGERS’ EQUATIONS 
DOUBLE PRECISION U,V,CI,CY,CCI,CCY 

DIMENSION U(21,21),V(21,21),4J(50,50) 

COMMON DX,DY,RE,NX,NY,DTI 


И = (ИХ - 2)+(NY - 2092 


D02J = 1,8 

DO 1 K = 1, 
AJ(K,J) = 0 
CONTINUE 

NXP = WX - 1 

WYP = М - 1 

СХ = 0.5/0Х 

СҮ = 0.5/DY 

CCX = 1./RE/DX/DX 
ССҮ = 1,./RE/DY/DY 
ро 7 J = 2, ЮР 

JM» J-1 
JP=J+1 

DD 6 К = 2,ЖҮР 

KM = K- 1 

KP = K +1 

MB = 2*(K-2) + 2%(ШҮ-2)%(1-2) 
LU = MB * 1 

LV = MB + 2 

ML = MB ~ 2%(ШҮ - 2) 


IF(ML .LT. 0)GOTO 3 
AJ(LU,ML*1) = - CX*«U(K,J) - CCX 


нт 


AJ(LV,ML+2) = - CX«U(K,J) - CCX 

AJ(LU,MB*1) = 2.*(CCX4CCY) + CX*(U(CK,JP)-U(K, J4)) + DTI 

AJ(LU,MB+2) = CY*(U(KP,J) - U(KM,J)) 

AJ(LV,MB*1) = CX*«(V(K,JP) - V(K,JM)) =- 
AJ(LV,MB*2) = 2.*(CCX+CCY) + CY*(V(KP,J) - V(KM,J)) + DTI 


MR = ИВ + 2%(ШҮ-2) 

IF(MR „СТ. W-2)GOTO 4 
AJ(LU,MR*1) = CX*UCK,J) - CCX 
AJ(LV,MR+2) = CX*U(K,J) - CCX 


MK = MB - 2 

IF(MK .LT. O)GOTO 5 
AJ(LU,MK*1) = - CY*V(K,J) - ccY 
AJ(LV,MK*2) = - CY*V(K,J) - CCY 
MT = MB + 2 


IF(MT .GT. #-2)СОТО 6 
AJ(LU,MT+1) = CY*V(K,J) - CCY 
AI(LV ,MT+2) = CY*V(K,J) - ССҮ 
CONTINUE 

CONTINUE 


RETURN 
EFD 
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Table 6.2. Parameters used by program NEWTBU 
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Parameter 


NX, NY 


ITMX 
IRD 


IPN 
EPS 


Description 


number of grid points in the x and y directions 
maximum number of iterations 
=0, 1 generate starting solution from subroutine EXBUR 
1, 2 write final solution to file NEWTBU.STA 

—2 read starting solution from NEWTBU.STA 

print sample of the residuals every IPN iterations 
tolerance on the rms equation residuals 


Reynolds number, Re, (6.13) 


under-relaxation factor, c, (6.19) 
time step, used by pseudotransient version (Sect. 6.4) 
number of equations to be solved 
exact solution, й, ё 
. , а (6.16) 


qa, . 
dependent variables (6.13) 


equation residuals (6.13) 
=R after return from RESBU 
contains corrections, Ag, on return from SOLVE 
Jacobian, J, (6.17) 
corrections to и and v; Ag іп (6.17) 





REWIONS METHOD FOR М = 18 


DT» ,50000Е+03 ЕР$= 
Аш 110.13 110.13 


UE= 
UE= 
СЕ 
UE= 
UE= 
VE= 
VE= 
VE= 
YER 
VE= 
ІТ" 0 
IT= 20 
IT= 40 
IT= 60 
ІТ 80 
1Т=100 


.9990 
.9990 
.9990 
.9989 
.9988 
.0000 
‚1317 
‚4679 
‚4142 
‚5714 
ЕМб= 
RMS* 
RMS* 
RHS= 
RN S= 
RHS = 


.9586 
‚9583 
„9572 
‚9553 
‚9524 
‚0000 
‚1277 
‚2598 
„4010 
.5577 
.1496D400 
.4854D-01 
.5504р-02 
.10240-02 
.1892D-03 
.24990-04 


Үш 
Үш 
Үш 
Үш 
R= 
R= 


ITMX-400 IRD= 1 IPN=20 
-100E+02 OK» 


.100E-04 ВЕ= 


.00 .00 


.4888 
.4863 
‚4786 
.4655 
.4462 
‚0000 
‚0753 
„1515 
„2297 
. 5109 


1.00 


-.0559 
-.0565 
-.0584 
-.0614 
-.0657 
.0000 
.0090 
.0179 
.0266 
‚0349 


-.0991 
-.0991 
-.0991 
-.0992 
-.0992 
‚0000 
‚0012 
‚0023 
‚0034 
‚0045 


.150Е+00 


-.21690+00 -.2850D-01 -.2188D400 
-.84070-02 -.11050-02 -.84830-02 
-.3259D-03 -.4283D-04 -.3289D-03 
-,1264D-04 -.1662D-05 -.1277D-04 
-.4914D-06 -.6470D-07 -.4977D-06 
~.1930р-07 -.2559D-08 -.1981D-07 


AFTER 115 ITERATIONS TEE RMS RESIDUAL IS 


y= 
y= 
Ци 
Ци 
Пи 
Үш 
Үш 
Үш 
Үш 
yz 


.9990 
.9990 
.9990 
.9989 
.9988 
.0000 
.1317 
.2679 
.4142 
.5774 


.9586 
.9605 
.9603 
.9577 
.9524 
.0000 
.1282 
.2605 
.4017 
.5577 


.4888 
.4835 
.4748 
.4625 
.4462 
„0000 
‚0748 
‚1507 
‚2490 
‚3109 


~,0559 
-.0567 
-.0585 
-.0615 
-.0657 
‚0000 
„0090 
‚0179 
‚0266 
‚0349 


.98726D-05 


7.0991 
-.0991 
-.0991 
-.0992 
-.0992 
.0000 
.0012 
‚0023 
‚0034 
‚0045 


Fig. 6.14. Typical output from 
program NEWTBU 
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An alternative, and more effective, strategy to (6.19) is based on combining 
Newton’s method with a pseudotransient formulation (Sect. 6.4). The current 
problem is solved by the pseudotransient Newton’s method in Sect. 6.4.1. 


6.1.4 Quasi-Newton Method 


An alternative approach (often called the quasi- Newton method) to solving sys- 
tems of nonlinear algebraic equations, is to replace (6.4) by 


VEDI VO_gQhHOR® | (6.20) 


where Н is an approximation to (J®)~ ' which is systematically modified at each 
step of the iteration so that it approaches (J'?) ^! as the converged solution is 
approached. The modification of Hf? at each step is considerably more economical 
than the factorisation of J. Equation (6.20) can be written as 


y**vu-c y & — gy gd А (6.21) 


where 20 = H®R®, and can be interpreted as a search direction. The scalar w™ is 
chosen so that К+ 0 is a minimum in the search direction, 29, This last feature 
provides a much larger radius of convergence than the conventional Newton's 
method, when N is large. A typical algorithm, the BFGS formula (Fletcher 1980, 
р. 44), would upgrade / 5%) using (6.21). Subsequently - 


y? = R€*0— RO А 
p zz – нәр А 
В® = 1 /P®Ty® А апа 
He* 1) — He - pr НФую po7 4. py OT Н) 
+ pr + p, FO yl) Р®Р®Т | (6.22) 


Although J is typically sparse, the structure of (6.22) indicates that Н is dense. For 
k=0, H is set equal to the identity matrix 2 

The effectiveness of quasi- Newton methods often depends on J having special 
properties, such as positive definiteness (Jennings 1977a). Thus the applicability to 
fluid flow computational formulations needs to be examined on a case-by-case 
basis. Much of the literature on quasi-Newton methods relates to unconstrained 
minimisation (e.g. Powell 1976, Fletcher 1980). Specific discussions of quasi- 
Newton methods are available in Broyden (1965), Ortega and Rheinboldt (1970) 
and Shanno (1983). A typical application of a quasi- Newton method to a fluid flow 
problem is provided by Engelman et al. (1981). 
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6.2 Direct Methods for Linear Systems 


For elliptic problems that are governed by linear equations, e.g. potential flow (Sect. 
11.3) or at intermediate stages of Newton’s method, it is necessary to solve a linear 
system of algebraic equations 


AV=B (6.23) 


for the components of the vector V where all coefficients in the matrix A and vector 
B are known. Gauss elimination (Dahlquist and Bjorck 1974, pp. 147-159) is the 
preferred technique for solving (6.23) but the efficient implementation of the method 
depends on the structure of A. Three categories of A are worth identifying: 


(a) A contains few or no zero coefficients. A is said to be full or dense. 

(b) A contains many zero coefficients. A is said to be sparse. | 

(с) A contains many zero coefficients and the non-zero coefficients are clustered 
close to the main diagonal. A is said to be sparse and banded. | 


Computer programs based оп the assumption that A is ІШІ are more general 
but also computationally more expensive. Such programs could also be applied to 
sparse and/or banded matrices, but more economical special procedures are 
available for these categories. Whatever the structure of A, Gauss elimination 
requires two stages. First A is factorised into lower triangular, Z, and upper 
triangular, U, factors. Second, the factored form of (6.23) is solved as 


UV-L^B. (6.24) 


Because of the structure of U, (6.24) only involves a back-substitution, once LB 
has been formed. 


6.2.1 FACT/SOLVE: Solution of Dense Systems 


When 4 is dense, all coefficients in А are operated on to form Ё and U. Subroutine 
FACT (Fig. 6.15) performs the factorisation LU- 4. The subsequent back-sub- 
stitution (6.24) is performed by subroutine SOLVE (Fig. 6.16). 

At each step of the factorisation a multiple of the current row k is subtracted 
from each subsequent row i to eliminate v,. The multiplier is a; ,/a, к and а, к is 
called the pivot. If the pivot is zero or very small then Gauss elimination fails or is 
very inaccurate. To avoid this situation partial pivoting is introduced. To eliminate 
в, a search is made of the kth column in the range i=k+ 1, № to find the largest 
pivot a; , (FACT, lines 17—24). If the largest pivot is zero then an error condition is 
flagged [JPVT(N)= — 1] and a return is made to the calling program. 

Assuming the largest pivot is not zero, a row interchange is made (FACT, lines 
39-41) to bring the largest pivot, a; ,, into the position (k, К). For each column j 
(1=К-+1, N) coefficients in the ith row (і-К--1, №) are modified (FACT, lines 
38—46) according to 
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SUBROUTINE FACT(N,A,JPVT) 

FACTORISES А INTO PERMUTED L.U SO THAT PERM*A = LeU 
JPVT(K) GIVES INDEX OF KTH PIVOT ROW 

SETS JPVT(N) = -1 IF ZERO PIVOT OCCURS 


DIMENSION A(50,50), JPVT(50) 


Oo - Q mob wh н 
00000 


Ж = й- 1 
әс 
10 C GAUSS ELIMINATION WITE PARTIAL PIVOTING 
iic 
12 DO 5 К = 1,JM 
13 KP =K+1 
14 С 
15 С SELECT PIVOT 
16 С 
17 ЕК 
18 DO 1 I = KP, 
19 IF(ABS(A(I,K)) „СТ. ABS(A(L,K)))L = I 
20 1 CONTINUE 
21 JPVT(K) = L 
22 S = A(L,K) 
23 ACL,K) = a(K,K) 
24 А(К,К) = 5 
25 С 
26 С CHECK FOR ZERO PIVOT 
27 C 
28 IF(ABS(S) .LT. 1.0E-15)G0TO 6 
29 C 
30 С CALCULATE MULTIPLIERS 
31€ 
32 DO 2 I = KP, 
33 А(Т,К) = -А(І,К)/5 
34 2 CONTINUE 
35 С 
36 С INTERCHANGE AND ELIMINATE BY COLUMNS 
arc . 
38 DO 4 J = КР. и - 
39 S = A(L,J) 
40 A(L,J) = A(K,J) 
41 АСК, J) = 5 
42 IF(ABS(S) 11. 1.0E-15)G0TO 4 
43 DO 3 I = КР,и 
44 A(I,J) = A(I,J) + A(I,K)*S 
45 3 CONTINUE 
46 4 CONTINUE 
47 Б CONTINUE 
48 RETURN 
49 6 ЗРУТ(И) = -1 
50 RETURN 
51 END Fig. 6.15, Listing of subroutine FACT 


а; к 
94, j=41,5-\ 7 Jai - (6.25) 


Arrays in computers are stored by columns (index i in а; у). Therefore it is more 
efficient to make the innermost loop in FACT (lines 43-45) operate on the first 
index. 

Subroutine SOLVE first manipulates В (lines 12-21) to interchange with the 
pivot row and to modify b,, i=k-+1, М by subtracting multiples of the pivot row, 


hb - (B) , (6.26) 
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SUBROUTINE SOLVE(N,A, ЈРУТ, В) 


1 
2c 

зс SOLVES LINEAR SYSTEM, А*Х = B 
4C ASSUMES А IS FACTORISED INTO L.U FORM (ВҮ FACT) 
5c RETURNS SOLUTION, X, IN 8 

6c 

7 DIMENSION A(50,50), JPVT(50).B(50) 
8c 

ec FORWARD ELIMINATION 

10 С 

11 Ш-И-і 

12 DO 2К=1, им 

13 KP = K + 1 

14 І = JPvT(K) 

15 $ = B(L) 

16 BCL) = B(K) 

17 B(K) = 5 

18 DO 1 I = KP,N 

19 B(I) = BCI) + А(Т,К)*5 

20 1 CONTINUE 

21 2 CONTINUE 

22 С 

23 С BACK SUBSTITUTION 

24 С 
25 DO 4 КА = 1, ИМ 
26 KM = И - КА 

27 K= KM + 1 

28 B(K) = BOO/ACK,K) 

29 $ = - В(К) 
30 DO 3 I = 1,KM 
31 В(І) = BCI) + A(I,K)*S 
32 3 CONTINUE 
33 4 CONTINUE 
34 B(1) = B(1)/A(1,1) 
35 RETURN 

36 END Fig. 6.16. Listing of subroutine SOLVE 


corresponding to the equivalent operation (6.25) in FACT. The final solution is 
obtained by SOLVE (lines 25-34) by back-substitution. That is 


by У 
Vy=— and р={Ь— Y Qj 0 |] ài. 


ам, м к-ізі 


To reduce storage the solution V overwrites and is returned аз В. 

Implementing Gauss elimination in two parts, subroutines FACT and SOLVE, 
is useful when solutions for multiple right-hand sides, B in (6.23) are required. This 
is because FACT requires O(N?) operations, whereas SOLVE requires O(N’) 
operations. Subroutines FACT and SOLVE, in slightly fuller form, are discussed at 
greater length by Forsythe et al. (1977, Sect. 3.3). 

Typically spectral methods (Sect. 5.6) and panel methods (Sect. 14.1) produce 
matrices A with a dense structure. In contrast, the finite element and finite 
difference methods produce matrices A that are sparse and, with the use of splitting 
(factorisation) techniques (Sect. 8.2), narrowly banded. 

When A is sparse, but not necessarily banded, Gauss elimination is often 
implemented by storing A as a single array containing the magnitude of a, ; with an 
associated pointer array, ГА, providing information about the location of (i, j). The 
major difficulty in coding sparse Gauss elimination is that fill-in of 4 occurs. That 
is, during the elimination process entries іп 4 that were previously zero become 
non-zero. The different strategies for updating 4 and ГА are discussed by Jennings 
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(1977, Chap. 5), and, at a more advanced level, by Duff (1981), George and Liu 
(1981) and Duff et al. (1986). Packages for implementing sparse Gauss elimination 
are available, e.g. Duff (1981, pp. 1-29). 


6.2.2 Tridiagonal Systems: Thomas Algorithm 


The use of three-point finite difference formulae or finite elements with linear 
interpolation leads, after splitting (Sect. 8.2), to a tridiagonal structure for A in 
(6.23) The use of higher-order finite difference schemes or higher-order finite 
elements produces a larger bandwidth in A. The Thomas algorithm is suitable for 
solving (6.23) when A is tridiagonal. The extension of the Thomas algorithm to the 
case when A is pentadiagonal is described in Sect. 6.2.4. For systems of equations, A 
has a block (tridiagonal) structure typically. The treatment of A for this case is 
considered in Sect. 6.2.5. 

When the nonzero elements Пе close to the main diagonal it is useful to consider 
variants of Gauss elimination that take advantage of the banded nature of 4. An 
example is provided by the convection-diffusion problem of Sect. 9.3. Using centred 
difference formulae the following algorithm (in the present notation) is obtained 


—(1 4 0.5Raa)v;- , 20 (1 05R4)0,120 , (6.27) 


which, when repeated for every node, gives 


b, с, 01 4, 
аҙ Б, c; 9 4, - 
а, Б; с; v; = d; ‚ (6.28) 

аң-і Ән-і См-1 ÜN-1 dy- i 

ау by UN dy 


where a;2 —(14+0.5R,,3;), b;22, c= —(1—0.5R,,;,). Nonzero values of d; are 
associated with source terms, or, for d, and dy, with boundary conditions. АП terms 
in A, other than those shown, are zero. The change in notation, particularly in 
relation to b;, from that in Sect. 6.2.1, may be noted. 
The Thomas algorithm for solving (6.28) consists of two parts (Fig. 6.17). 
First (6.28) is manipulated into the form 


А / 
1 C1 01 4, 
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хх v x Fig. 6.17. The Thomas al- 
ххх . х forward gorithm for solving a tridia- 
.. x swee onal system of equations 
ххх “іг [x Р Е sy q 
eee x 
ххх || * х 
хх [ini [x 
Yi i+ vi| |+ 
backward ` 1+ "LI 
sweep jac]  *: М 
< Р i+ oleh 
ae е + 
те |ы [+ 


ie. the а; coefficients have been eliminated and the Б; coefficients normalised to 
unity. For the first equation 


cat, d=% (6.29) 


and for the general equation 


I Ci 
^ bad | 
(6.30) 
та: 
Ба; с: 1 


The equations are modified, as in (6.30), in а forward sweep (Fig. 6.17). Тһе second 
stage consists of a back-substitution (backward sweep in Fig. 6.17), 


vy=dy апа 
9; dj — 9; 16i . ` (6.31) 


The Thomas algorithm is particularly economical; it requires only 5N —4 
operations (multiplications and divisions). But to prevent ill-conditioning (and 
hence round-off contamination) it is necessary that 


Ib > Ja 4- lei] . 


The use of splitting for multidimensional problems (Sect. 8.2) typically generates 
tridiagonal systems of equations that can be solved efficiently using the Thomas 
algorithm. 


6.2.3 BANFAC/BANSOL: Narrowly Banded Gauss Elimination 


When 4 is narrowly banded subroutines BANFAC and BANSOL are suitable for 
performing Gauss elimination. Subroutine BANFAC (Fig. 6.18) carries out the 
forward sweep or repeated forward sweeps to render A upper triangular. Sub- 
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SUBROUTINE BANFAC(B,N,INT) 


FACTORISES BAND MATRIX ARISING FROM LINEAR OR QUADRATIC ELEMENTS 
INTO L.U 


DIMENSION B(5,65) 
IF(INT .EQ. 2)GOTO 2 


NaN Ae UN e 
ооо 


эс 
10 С INT = 1, LINEAR ELEMENTS = TRIDIAGONAL SYSTEM 
11 С 

12 WP = И - 

13 DO 1 J = 1,ИР 

14 ЈР = Ј+1 

15 В(2, ЈР) = В(2, ЈР) /В(3,Ј) 

16 B(3,JP) = B(3,JP) - В(2, ЈР) *B(4,J) 

17 1 CONTINUE 

18 RETURN 

19 C 

20 C ТИТ = 2, QUADRATIC ELEMENTS = PENTADIAGONAL SYSTEM 
21€ ASSUMES FIRST EQUATION FORMED AT MIDSIDE NODE 
22 С 

23 2 ИН = 1/2 

24 00 5Т= 1,2 

25 15- 3-І 

26 DO 4 J= JS,NH 

27 ЈА = 2*(1-1) 

28 IF(I .Е0. 2)GOTO 3 

29 C 

30 C I = 1, FIRST PASS, REDUCE TO TRIDIAGONAL 

31€ 

32 JB = ЈА + 2 

33 B(1,JB) = 8(1,JB)/B(2,JB-1) 

34 8(2,J8) = B(2,JB) - B(1,JB) *B(3, 38-1) 

35 8(3,J8) = B(3,JB) - B(1,JB)*B(4,JB-1) 

36 GOTO 4 

37 C 

38 C I = 2, SECDND PASS, REDUCE TO UPPER TRIANGULAR ~ 
39 C 

40 3 JB = JA * 3 

41 8(2,J8-1) = B(2,J8-1)/8(3,JB-2) 

42 8(3,J8-1) = B(3,JB-1) - B(2,J8-1)*B(4,JB-2) 

43 IF(JB .GT. N)GOTO 4 

44 8(2,J8) = 8(2,JB)/B(3,J8-1) 

45 8(3,28) = B(3,JB) ~ B(2,JB)*B(4,JB-1) 

46 8(4,J8) = B(4,JB) ~ B(2,JB)*B(5, JB-1) 

4T 4 CONTINUE 

48 5 CONTINUE 

49 RETURM 

50 END Fig. 6.18. Listing of subroutine BANFAC 


routine BANSOL (Fig. 6.19) modifies the right-hand side, B in (6.23), and solves the 
resultant system of equations by back-substitution. 

For INT=2, BANFAC and BANSOL have been coded to make use of the 
particular structure of A that is produced by one-dimensional quadratic elements. 
That is, A is alternately “tridiagonal” (three contiguous non-zeros per row) and 
“pentadiagonal” (five contiguous non-zeros per row), corresponding to equations 
associated with midside or corner nodes (Fig. 5.10), respectively. 

For this case the factorisation process (BANFAC) consists of two sweeps. The 
first sweep operates only on the “pentadiagonal” rows to eliminate the extreme 
coefficient а, ,- 2. The second sweep reduces the matrix to upper triangular form. 
The precise implementation depends on whether the first and last equations in 
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1 
2 SUBROUTINE ВАНЗОГ(В,Х,В,Н, INT) 
3c 
4c USES L.U FACTORISATION TO SOLVE FOR X, GIVEN R 
BC 
6 DIMENSION R(65),X(65),B(5,65) 
7 IF(INT .EQ. 2)4070 3 
ac 
өс INT = 1, TRIDIACONAL SYSTEM 
10 С 
11 Wee N- 1 
12 DO 1 J = 1,NP 
13 Іре1і 
14 1 КОР) = А(ЈР) ~ 8(2, JP) R(CJ) 
15 X(N) = R(N)/BC3,N) 
16 DO 2 J = 1,NP 
17 А-Я-:2 
18 ХСА) = (R(JA) - B(4,JA)*X(JA+1))/B(3, JA) 
19 2 CONTINUE 
20 RETURN 
21 С 
22 € INT = 2, PENTADIAGONAL SYSTEM 
23 с ASSUMES FIRST EQUATION FORMED AT MIDSIDE NODE 
24 С SET IBC - O IF LAST EQUATION FORMED AT MIDSIDE NODE 
25 C SET IBC = 1 IF LAST EQUATION FORMED AT CORNER NODE 
26 C 
27 3 ІВС = 1 
28 NH = N/2 
29 IF(2*NH .EQ. Н)В(Н+1) = 0. 
30 IF(2*NH .EQ. N)B(2,N*1) = 0. 
31 2061-1,2 
32 DO 5 J = 1,NH 
33 JA = 2*1 
34 ро4аК = 1,1 
35 JB* JA- 1 * K 
36 4 R(JB) = R(JB) ~ B(I,JB)*RCJB-1) 
37 5 CONTINUE 
38 6 CONTINUE 
39 НЕН = NH - IBC 
40 X(N) = R(GD/B(G,N) 
41 IF(IBC .EQ. 1)Х(И-1) = (В(Н-1) - В(4,Н-1)*Х(Н))/В(3,И-1) 
42 DO 7 J= 1,NEE 
43 ЈА = N - 2%] + 1 - IBC 
44 X(JA) = (ВОА) - B(4,JA)*X(JA*1) ~ B(5,JA) *X(JA*2) )/B(3, JA) 
45 ХСТА-1) = (RCJA-1) ~ B(CA, JA-1) 1(JA) )/B(3, JA-1) 
46 7 CONTINUE 
47 RETURN 
48 END Fig. 619. Listing of subroutine BANSOL 


^ 


(6.23) are associated with midside or corner nodes. Dirichlet boundary conditions 
correspond to midside nodes; Neumann boundary conditions correspond to 
corner nodes. The reader may find it helpful to examine the code relating to the 
option, INT =2, after looking at Sect. 6.2.4. 

An example of the use of BANFAC/BANSOL for both tridiagonal and 
alternating tridiagonal/pentadiagonal matrices is provided by the solution of the 
Sturm-Liouville problem (Sect. 5.4.2). 

А useful test that the subroutines solving (6.23) have been coded correctly is to 
multiply up the left-hand side of (6.23), after V has been obtained, and to ensure 
that the left-hand side is equal to the right-hand side. However, this does not 
guarantee an accurate solution V if A is ill-conditioned. That is if the equations 
forming the system (6.23) are close to linear dependence. Ill-conditioned systems, 
and the effect on solution accuracy, are discussed by Gerald (1978, pp. 106-113) 
and Dahlquist and Bjorck (1974, pp. 174-185). 
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6.2.4 Generalised Thomas Algorithm 


The use of higher-order finite difference and finite element schemes leads to 
bandwidths larger than tridiagonal. However, the Thomas algorithm can be 
generalised without difficulty. Suppose that the following pentadiagonal system is 
to be solved: 


ен ау by Uy dy 


The first stage consists of eliminating all the е, terms to give 


b, с, fi 01 4 


ГА РА Ё 
ау-і by-1 Су-\ 


ГА t 
ау М UN dy 
where 
a; =a;—e;,b;- 118-1, 
b;=b;—e,¢;-1/4;-1 > | 
а=с—еД-1/%-1 > | (6.32) 


fi-f , 


f 
d; — d;— e;d;. , /aj-, . 


The next stage is essentially the same as the first stage of the conventional Thomas 
algorithm. That is, all the a; terms are eliminated to give 
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Га fi 01 4 
+ 
гал 
| cg fi vuv | =| а; |, (6.33) 
1 см, . 
1 Dy dy 
where 
с! – а! и 


= е (6.34) 


= ГА ии 5 
bj — ас. 


The solution of (6.33) then requires a back-substitution of the general form 
v= d; — civi ea Уна. (6.35) 


It is apparent that the pentadiagonal system requires two forward sweeps and one 
backward sweep. 

The various stages of the generalised Thomas algorithm can be interpreted as a 
sequence of operations (forward sweeps) to reduce the matrix А to upper-triangular 
form at which point a back-substitution can be performed to obtain V. Clearly the 
greater the bandwidth of А the less economical will be the above algorithm. 

If the bandwidth itself contains a large number of embedded zeros then 
alternative sparse matrix techniques (Jennings 1977a, Chap. 5) are usually more 
economical, although more difficult to code. 


6.2.5 Block Tridiagonal Systems 


The Thomas algorithm for solving tridiagonal systems of equations is given in 
Sect. 6.22. This algorithm is required, typically, when a single governing equation is 
discretised using an implicit algorithm (Sect. 7.2). However, many flow problems 
are governed by systems of equations (Sect. 10.2 and Chap. 11). Attempts to 
implement implicit algorithms produce, typically, block tridiagonal rather than 
scalar tridiagonal systems of equations. However the Thomas algorithm can be 
extended to handle this situation without difficulty. 
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A block tridiagonal system of equations, equivalent to (6.28) can be written as 


b, с, | V, d, 

a; b, с; г, 4, 
а b, с, 4 = 4 |, 

ан-і бу-у ey-4 Vu-1 dy, 

ау b, Vy dy 


(6.36) 


where a; b, с; are M x M submatrices and И, and d, are M-component subvectors. 
The number M corresponds to the number of equations formed at each grid point, 
e.g. M —5 for three-dimensional viscous compressible flow (Sect. 11.6.3). Conse- 
quently И; is the solution subvector associated with a particular grid point. 
Equation (6.36) represents a system of N blocks of equations, where each block 
associated with a particular grid point contains M equations. 

The solution of (6.36) follows the solution of (6.28) closely. First, the block 
tridiagonal matrix, (6.36), is converted into upper triangular form by eliminating the 
submatrices а,. By analogy with (6.29) the first block of equations gives 


—(b) 'e, and di-(5,)"'d, , (6.37) 


and for the general block 
B-b-ad.. =b) u, di-(À) '(d-adi.i) . (6.38) 


Explicit matrix inverses are shown in (6.37 and 38) In practice it is more 
economical to solve the M component subsystem, e.g. b; с, = с; is solved for cj. After 
execution of (6.37 and 38), (6.36) is of upper triangular form with c; and d; replacing 
с, and d; and the unit matrix / replacing 5,. 

The second stage, equivalent to (6.31) requires a back-substitution 


UA and 
=d;— СИ +: - (6.39) 


The operation count for the block Thomas algorithm is O(5N M?/3) operations, 
which is clearly preferable to O((N.M)?/3) for full Gauss elimination. However, if it 
were possible to split the block tridiagonal system (6.36) into M scalar tridiagonal 
systems the total operation count would be O(5NM) operations, і.е. roughly an 
М?/3 saving for М» M. Consequently there is an incentive to construct implicit 
algorithms that allow some of the equations to decouple, e.g. Sect. 14.2.8. The block 
Thomas algorithm described above follows Isaacson and Keller (1966, pp. 58-61). 
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6.2.6 Direct Poisson Solvers 


The Poisson equation (and the related Laplace’s equation) occurs sufficiently often 
in fluid dynamics to make special procedures for solving the discretised Poisson 
equation worth investigating. A three-point finite difference discretisation of the 
two-dimensional Poisson equation produces a system of equations (6.23) for which 
A has the form shown in Fig. 6.1. 

For a uniform grid (4x= Ay) in a rectangular domain 0<x<1, 0€ y € 1, one 
block of the system of equations can be written 


1+, +10, =, К-і,...,М, (6.40) 


where М is the number of active points in the y direction. Each vector V, contains 
М unknown фуу values associated with one grid line (y= y;) in the x-direction. The 
matrix g is tridiagonal of order N with non-zero entries 1, — 4, 1 associated with the 
points (j — 1, k), (j, k) and ( j+ 1, k), respectively. The vector A, contains all the right- 
hand side contributions to B in (6.23) coming from a single x grid line (у= y,). 

Two techniques, cyclic reduction and Fourier series representation, are particu- 
larly efficient in solving (6.40). The preferred strategy (Swartztrauber 1977) is to 
start with cyclic reduction to reduce the size M of (6.40) and then to solve the 
reduced system using a Fourier series representation. 

In cyclic reduction the block for the kth (even) line of (6.40), multiplied by — g, is 

added to the blocks for the (k— 1)-th and (k+ 1)-th lines to eliminate V,_, and 
V, ,.,. The result is 


Ра g V, IV, — hU А where 
g"-2I—-gg-(/2I*g)(/21—g) апа (6.41) 
AYO =h- gh 5%; . 


This process сап be repeated as many times as required. After | reductions, (6.40) 
becomes ` 


IV, g9V-- IV, s — О, > where (6.42) 
2t 
g= -II (g — БГ) 


and f'=2 cos[(2k — 1)л/2!*1]. 

Іп principle the reduction сап be continued until the boundary conditions 
determine У,_ апа И, + 4: (Dorr 1970). This permits back-substitution to deter- 
mine all the intermediate У, vectors. However, in practice, it is more efficient to 
terminate the process after | reductions and to switch to a Fourier series 
representation. 

If Dirichlet boundary conditions are applied on all boundaries the following 
would be an appropriate Fourier series representation: 
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N .{ sjin 
Ujk = ЛЕН СЕ 5) Гог К= 1, 4... М, К (6.43) 





where М, is the number of x gridlines left in (6.42) after / reductions. The coefficients 
U, , are obtained by solving the tridiagonal systems 








U,,-1 tas +0, к+1=Н,к for s=1,...,N, (6.44) 
where 
H- F h(x ТЕН БСШ (6.45) 
„к= N+1 А э Ук (М +1) ; . 
h(x;, Ук) is an element of A? in (6.42) and 
0.557 
A,2 —2| 142 sin? . 6.46 
; | +: sin (S) | (6.46) 


The fast Fourier transform (Cooley et al. 1970; Brigham 1974) is used to solve (6.43 
and 45). Slightly different forms of these equations and (6.46) occur with different 
choices of the boundary conditions (Swartztrauber 1977). 

For an N=M grid the optimum number of cyclic reductions is given by 
Swartztrauber (1977) as [-1og;(log;N) —1. The combined algorithm FACR(), 
which was first described by Hockney (1970), has an operation count of 
N?log,(log,N). For № = 1000, 1--2, the FACR(2) algorithm is approximately 30 
times faster than the ADI algorithm (Sect. 6.3.2) applied to a Poisson equation. The 
implementation of FACR(I) on supercomputers is discussed by Hockney and 
Jesshope (1981, pp. 346-351) as are strategies for solving discretised Poisson 
equations in three dimensions (Hockney and Jesshope 1981, pp. 351-352). 

For detailed discussions of direct methods for the Poisson equation and the fast 
Fourier transform the reader is referred to Hockney and Jesshope (1981), Swartz- 
trauber (1977) and Dorr (1970) and the references cited therein. 

It should be made clear that the speed of cyclic reduction and the Fourier series 
representation comes from the symmetric, constant coefficient nature of the dis- 
crete Laplacian on a uniform rectangular grid. For an arbitrary grid, discretisation 
of the Poisson equation, e.g. via the finite volume method (Sect. 5.2), an isopar- 
ametric finite element formulation (Sect. 5.5.3) or the use of generalised coordinates 
(Chap. 12), leads to an algebraic equation with variable coefficients. For example, 
with generalised coordinates this could be written 


aj Vj 1k bi Mos eb Ира А кн tE Vim hy, (6.47) 


Or, equivalent to (6.40), 


СЕЙ, - +G Vet А, Қ-А, - (6.48) 
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As before, С, is tridiagonal and C, and D, are diagonal. However, the cancellation 
that led to (6.41) does not occur with (6.48). 

In applying the Fourier series representation to (6.47) a dense system of 
equations for U, , replaces the tridiagonal system (6.44). Consequently the Fourier 
series representation applied to (6.47) is no more economical than Gauss elimin- 
ation (Sect. 6.2.1) applied directly to (6.47). Depending on the grid, (6.47) may also 
include non-zero contributions associated with grid points (7—1, k+1), (7—1, 
k—1), (j+1, k+1) and (j+1, k— 1). 


6.3 Iterative Methods 


Iterative methods can be applied directly to the nonlinear system of equations (6.1); 
however, it is easier to generalise iterative techniques by considering the linear 
form (6.2). All iterative techniques can be viewed as procedures for successively 
modifying an initial guess so that the solution is systematically approached. 
Generally, simple iterative methods will not converge unless A, in (6.2), has large 
elements on the main diagonal (6.52). 


6.3.1 General Structure 


The general structure of stationary iterative techniques is illustrated by rewriting 
(6.2) as 


(N—P)V=B, (6.49) 


where N is close to A in some sense, і.е. |/У|ғ414|, but computationally efficient to 
factorise, e.g. N might be tridiagonal. Equation (6.49) сап be rewritten as 


NV"+D=PV™4B or 

yo*Dz2N-CIPV"-N^ IB or (6.50) 

Vor ро ур , : (6.51) 
where Ё is the vector of equation residuals at the nth step of the iteration, i.e. 

Re?-Ay"—B. 


As the exact solution is approached |R | tends to zero, so that monitoring [Rj 
indicates the progress towards convergence. The general iterative method consists 
of an initial guess, УЧ) and the successive improvement using (6.51). The various 
methods differ, primarily, in the choice for М. 

The scheme (6.51) will converge if the spectral radius, (i.e. the magnitude of the 
maximum eigenvalue) of N^ +P is less than unity. This condition often corresponds 
to the more restrictive condition that 4 is diagonally dominant, i.e. 
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ш>), [Ail - (6.52) 
іші 


The Jacobi method is опе of the simpler iterative schemes for solving (6.2). Іп 
this method 


N=DI and P=(L+U), 


where DI is the diagonal matrix and L and U are the strictly lower and upper 
triangular matrices respectively. Thus (6.50) becomes 


N 
"= (в->. А, ул . (6.53) 


j 
j*i 


The Jacobi method in the above form is uneconomic, requiring too many iterations 
to reach convergence. However, it can be significantly improved by either Cheby- 
shev acceleration (Hageman and Young 1981, Chap. 4) or conjugate gradient 
acceleration (Sect. 6.3.4). 

A more immediate improvement to the Jacobi method is provided by the 
Gauss-Seidel method in which values of vf^* are used on the right-hand side of 
(6.53) as soon as they are available. 

In this case 


N-DI-L, P-U | (6.54) 


and the equivalent of (6.53) is 


i-1 
(в Y Apo У ду? | 4ы 659 
іғі 


]=1+1 


Gauss-Seidel iteration is typically twice as fast as Jacobi iteration but cannot be 
accelerated. If A is diagonally dominant, (6.52), the convergence of Jacobi and 
Gauss-Seidel methods is guaranteed. 

Successive overrelaxation (SOR) provides a significant improvement over 
Gauss-Seidel by evaluating о" * as a weighted average of v and (v^ * P)gs. Thus 
the SOR scheme can be written as 


і-і М 
иә (в- 2 Av * — | Y Aut?) [ay |+а — Ay f? . (6.56) 
j= 


ізізі 


Тһе term 2 is the relaxation parameter. The restriction 0 « 4 «2, is necessary for 
convergence of SOR. In terms of (6.51) the SOR scheme corresponds to 


N- = -1. (6.57) 
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The number of iterations for convergence is sensitive to the choice of 4. Ап 
optimum choice is given by 
2 


where д is the largest eigenvalue of /— DI^ ! A. However, finding и explicitly can be 
as expensive as solving the original problem. Therefore a preferred strategy is to 
obtain an estimate of и as the iteration (6.56) proceeds. Equation (6.58) then 
provides an improved value for 4. Details are provided by Hageman and Young 
(1981, Chap. 9). 

The SOR scheme with a good estimate of /,,, is considerably more efficient 
than either the Jacobi or Gauss-Seidel methods. However, in the original form it is 
not possible to apply acceleration techniques, Chebyshev or conjugate gradient, to 
SOR. But by making a small modification to give the symmetric successive 
overrelaxation (SSOR) method it is possible to introduce acceleration techniques. 
The SSOR method consists of two stages. In the first the SOR scheme is used. In 
the second stage the unknowns are iterated in the reverse order using the SOR 
scheme with the same value of 4 as in the first stage. The SSOR method corre- 
sponds to the following choice for N in (6.51): 


A (6.58) 


(DI+ AL) DI” "(DEC U) 
AQ- 4) 


It should be emphasised that the SSOR method is less efficient than the SOR 
method, unless acceleration techniques are included. 


N- (6.59) 


6.3.2 Duct Flow by Iterative Methods 


In this section we apply the three iterative methods, Jacobi, Gauss-Seidel and 
successive over-relaxation (SOR), to the problem of fully developed laminar flow in 
a square duct. These three iterative methods are discussed in Sect. 6.3.1, as explicit 
or point algorithms. Using the Thomas algorithm (subroutines BANFAC and 
BANSOL, Sect. 6.2.3), these iterative methods can also be applied as implicit or line 
algorithms; line SOR and ADI will also be described in this section. 

The current example, the duct-flow problem, is also used to illustrate the finite 
element method in Sect, 5.5.2 and a computer program, DUCT (Fig. 5.22), is 
provided. Viscous flow in a duct is governed by the nondimensional equation (5.97) 


Ь\ 202% 02% 
(2) д + By? +170 , (6.60) 


with boundary conditions # =0 at х= +1, у= +1. The parameter b/a is the duct 
aspect ratio, Кір, 5.21. A three-point finite difference discretisation of (6.60) is 


2 
(5) Wj-1,4—2Wj et Wiik NALE —2Wwj , Wie 


Ti T. 41-0. (6.61) 
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Applying the Jacobi scheme (Sect. 6.3.1) to (6.61) produces the algorithm 


bM win) (n) wo (m) 
19-05] 1+; ) ы w | [PARI . (6.62) 


where (п) is the iteration index and РАВ! =(b/a)}?/Ax?+1/4y?. In the Jacobi 
scheme all grid-point values on the right-hand side of (6.62) are evaluated at the nth 
iteration level. 

In the Gauss-Seidel scheme grid-point values are evaluated at the latest 
iteration level available, Thus if the iteration sweeps repeatedly in the y direction 
(increasing k) for successive x values (increasing j) the Gauss-Seidel equivalent of 
(6.62) would be 


b WEE (n) (+1) (п) 
129-05] 1-2 ) "сыраны EE es |" РАВ (6.63) 


Іп the SOR scheme the solution of (6.63), now called үу %) is combined with the 
previous solution, м), as 


wir) = Aw} +(1—A)wi = wh A(wT — и), (6.64) 


where 4 is the relaxation factor. The Gauss-Seidel scheme is obtained when A=1. 
The SOR scheme applied to (6.61) in the range 0<A<2 will produce a converged 
solution. 

The three iterative methods are also applied with the finite element discretis- 
ation of (6.60), which is described in Sect. 5.5.2. For example, the equivalent of (6. 63) 
is given by (5.108). 

Table 6.3 provides a comparison of the number of iterations to reach conver- 
gence for the duct-flow problem, solved by the program DUCT, on an 11 x 11 grid. 


Table 6.3. Number of iterations to convergence (b/a — 1.0) 


Point Point Point ADI- ADI- 
FDM FDM FEM FDM FEM 
À (TOL=1 х 10-3) (TOL =1 x 107%) (TOL = 1 x 1076) (TOL =1 x 1075) (TOL =1 x 1075) 4, 
Jacobi 4l 87 64 19 19 0.9 
0.8 39 74 55 16 18 1.1 
0.9 33 62 45 14 17 1.3 
1,0 (GS) 28 51 38 12 16 1.7 
1 24 43 32 12 15 24 
1.2 21 36 26 12 14 2.5 
1.3 18 29 21 12 13 2.8 
14 15 23 17 11 14 3.3 
1,5 12 18 12 11 15 3.7 
1.55 11 15 13 11 16 4.1 
1.6 11 17 15 12 17 4.5 
1.7 14 21 20 
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Convergence is assumed when || К°], < TOL. Results for two tolerances are 
presented. The starting solution for the iteration is the exact solution of (6.60). For 
a finite grid this solution is not the same as the solution of the discretised equations 
(6.61). Starting from the solution м9 =0 requires more iterations, as can be 
appreciated by comparing the results shown in Tables 6.3 and 6.4. 

The results shown in Table 6.3 indicate that the SOR scheme, with the optimum 
value of 4, converges more quickly than the Gauss-Seidel scheme, which converges 
more quickly than the Jacobi scheme. For the finite difference method the reduc- 
tion of the rms value of the change in the solution, wf";  — и, is plotted against 
iteration number n in Fig. 6.20. 


N 
„JACOBI 


N 
V 


20 40 60 80 100 
lteration number, n 





Fig. 6.20. Convergence history for point FDM (TOL = 1 х 1075) 


The results shown in Table 6.3 indicate that the point finite element method 
(5.108, 109) converges more rapidly than the point finite difference method for 
values of A < 1.7. However the computational work per iteration is greater for the 
finite element method, so that it is not necessarily more efficient than the finite 
difference method for this problem. 

All the algorithms given above, that is (6.62—64) have been explicit. It is possible, 
and desirable, to consider groups of nodal unknowns implicitly, as long as the 
resultant system of equations can be solved efficiently. This can be done by forming 
tridiagonal systems of equations associated with each line (constant k) in the x 
direction. Thus (6.61) is then written as an algorithm as 
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b 2 2(b 2 Ь 2 
(BRE ue (P) en. 


wa —2w wl 





(6.65) 


This system can be solved conveniently using BANFAC/BANSOL (Sect. 6.2.3). 
Since the coefficients multiplying w‘*) are constant оп the left-hand side of (6.65), 
BANFAC is only required for the grid-line k = 2, The same factorisation is used by 
BANSOL for all grid-lines. The combination of (6.65) and (6.64) is known as 
successive line overrelaxation (SLOR). 

The ADI formulation (Varga 1962, Chap. 7) is similar to SLOR but with the 
implicit direction alternating at each iteration. Thus in place of (6.65) the following 
two-stage algorithm is used. For the first stage the following tridiagonal system 
associated with each line (constant К) in the x direction is solved: 


—A,w(9, ,-F (1--24,) wf 4-4 «к, =À Ау? + Аум - 1 
+(1—24„)» +A "ei . (6.66) 


For the second stage the following tridiagonal system associated with each line 
(constant j) in the y direction is solved: 


АЕ (1-2, 0 м = Ду +A.w , 
+(1—24)» + Ам, о, (6.67) 


where 4, is chosen to accelerate convergence and 4, = (bA y/aA х)24,. Strategies for 
choosing 4, are discussed by Varga (1962) and Wachpress (1966). 

The above ADI scheme is equivalent to the ADI scheme described in Sect. 8.2.1. 
Comparable formulae to (6.66, 67) can be obtained for the finite element method by 
adapting (8.35, 36). The number of iterations for the ADI finite difference method 
and ADI finite element method are shown in Table 6.3. Both methods give better 
convergence than the corresponding point algorithms for 4 not at the optimum 
value. 

The convergence rate is a strong function of grid refinement. As the grid is 
refined the number of iterations to convergence increases and the optimum 4 
becomes larger. This situation is illustrated in Table 6.4, which is based on a 
starting solution и) =0. 

The iterative algorithms considered above can be written as 


wt Do Gy | 


The reduction in the rate of convergence as the grid is refined corresponds to the 
maximum eigenvalue of G approaching unity. Some means of accelerating the 
convergence rate is clearly desirable. Hageman and Young (1981) discuss conjugate 
gradient and Chebyshev acceleration techniques. However, applied to Jacobi or 
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Table 6.4. Effect of grid refinement on convergence: Duct prob- 
lem by point SOR (FDM) 


Number of iterations 
to convergence 


Grid Optimum 4 (TOL=1 x 10-8) 
6x6 1.30 12 

11х11 1,55 23 

21х21 1.74 41 

41х41 1.86 79 


Gauss-Seidel schemes neither Chebyshev nor conjugate gradient acceleration 
produce a faster rate of convergence than SOR with an optimum value of the 
relaxation factor 4. As a very robust algorithm, Hageman and Young recommend 
SOR with an adaptive algorithm for choosing 4. A subroutine for this purpose 
is provided by Hageman and Young (1981, pp. 368-372). Jennings (1977a, 
pp. 210-212) discusses the use of Aitken acceleration. 

It should be pointed out that for iterative techniques, such as those described 
above, to be effective the problem being solved should be strongly elliptic, e.g, 
second-order derivatives are present but not first-order derivatives. Often this leads 
to the matrix of equations, e.g. those formed from (6.61), being symmetric and 
positive definite, The matrix A is symmetric and positive definite if 47 =A and 
x? Ах>0 for all x 40. 

Examples of equations that are effectively solved by the above iterative tech- 
niques are the Poisson equation (for the stream function or pressure) that arises in 
viscous flow problems based on the stream function vorticity formulation 
(Sect. 17.3) and the potential equation that often arises in inviscid flow problems 
(Chap. 14), particularly transonic flow (Sect. 14.3). 

If first derivatives of significant magnitude are present the above techniques are 
very ineffective and may fail to converge, For example, applying the point SOR 
(FDM) technique to the two-dimensional Burgers’ equations considered in 
бесі, 6.1.3, on a 21 x 11 grid, requires 4 < 1.0 for convergence. After 270 iterations 
the equation residuals are reduced to about 1.4х 107°, For comparison, an 
approximate factorisation of the unsteady Newton's method (Sect. 6.4.1) achieves 
the same degree of convergence in 15 iterations. 

Significant first derivatives occur in the momentum equations for the primitive 
variables (Sect. 17.1), the transport equation for the vorticity (Sect. 17.3) and the 
energy equation (Sect. 11.2.4). 


6.3.3 Strongly Implicit Procedure 


This strongly implicit procedure (SIP) scheme starts from the general stationary 
iterative technique (6.49) and factorises N into L U for the situation where A arises 
from three-point centred difference discretisations in two dimensions. In this case 4 
has the structure shown in Fig. 6.1. 
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Stone (1968) showed that this could be done in a way that produces three 
diagonals in І, and three diagonals in U of which the main diagonal elements are 
unity. Forming N= LU indicates that Р consists of two diagonals multiplying 
p 4 1,j- 1 and vj. | ;+,- Рога linear system of equations (6.2) the elements of L, U and 
P can be evaluated once and for all. 

The algorithm is implemented in two stages. First a forward sweep solves 


Lw**D=B+PV™ , (6.68) 
This is followed by a back-substitution 
Uye*DazyeD, (6.69) 


The algorithm is robust and is often more efficient than the ADI scheme (6.66, 67). 

Schneider and Zedan (1981) have developed a modified strongly implicit (MSI) 
algorithm that is applicable to either five-point or nine-point two-dimensional 
discretisations, i.e. it can be applied to three-point finite difference discretisations or 
linear finite element discretisations. Here the five-point version of MSI will be 
described; the reader is referred to Schneider and Zedan for the nine-point version. 

The MSI algorithm is executed using (6.68 and 69). However L, U and P differ 
from the form used by Stone. In the MSI algorithm L has four non-zero entries per 
row, U has three non-zero off-diagonal entries plus unity on the diagonal and P has 
two entries. If V is numbered in the increasing j and then increasing k directions, 
(6.68) becomes 


Лама —b; t Pla (02 а.-і-2(-20;,--20; 1.05. 1)1% 
кр? [0-2 к+1 —a(— 20; 20-250. DJ” 
-Геа а-а а-а-а 1” D, (6.70) 
The back-substitution (6.69) is implemented as 
vit D= wer ) — (gj xvj ЕЙ; ко; 1,k+1 tias)" М, (6.71) 


In (6.70 and 71) coefficients с, d, e and f are part of L, coefficients д, h and t are 
the part of U and coefficients р! and р? are part of P. The coefficients are related to 
the elements of A by 


ак а 


C=] , Т Ср рка 5 ёра р Cj T1) 
—€gi.k-18j- 1,k-1 


1 — 2 — 
Рк=@ркй+1,к-1 А Pjka—6jkhj- ik , 


1 2 
fix а-дайца-і- Шаһ акт jkBj- 1k + 20 Dit Pj, к) , (6.72) 
—e.:,t: 
1 Qk'j-l.k 
Фа“ 0316 „кй+а,к-1—2@р}у 5 h,,=—_ 5 
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The parameter « is used to accelerate convergence; «=0.5 is close to optimal. 
Schneider and Zedan (1981) indicate that MSI is typically two to four times more 
economical than SIP and does not require grid renumbering after every iteration as 
does SIP. 

Application of strongly implicit procedures, related to the above formulations, 
are reported by Rubin and Khosla (1981), Zedan and Schneider (1985) and Lin 
(1985). 


6.3.4 Acceleration Techniques 


The various iterative techniques described in Sects. 6.3.1-6.3.3 are effective but for 
some problems demonstrate rather slow convergence after the residual R 
has undergone an initial rapid reduction. If the error in the iterative solution, 
е= И) V*, is expanded as a Fourier series, the initial rapid reduction in the 
residual corresponds to the reduction in the amplitude of the short wavelength 
modes in the Fourier series, The subsequent reduction in the amplitude of the long 
wavelength modes often proceeds at a much slower rate. 

However, it is possible to modify the iterative techniques already considered to 
accelerate convergence, Two of the more effective techniques are Chebyshev 
acceleration and conjugate gradient acceleration (Hageman and Young 1981). 
Here we will describe the key features of conjugate gradient acceleration. 

Initially it will be assumed that the matrix A in (6.2) is symmetric positive 
definite. That is, х7 Ах>0 for any choice of x. The discretisation of Laplace’s 
equation with centred finite difference expressions produces a linear system of 
equations for which A is symmetric positive definite. 

The steps in the traditional conjugate gradient method (Hestenes and Stiefel 
1952) can be described as follows: 


i) perp pms 40 pm , 
ii) В" * 0 2 go» — 20 po , 
iii) о" 1) — (Ret Ur Rt 1) , 


(п+1) 
iv) ae* DP 


p? 
+1 +1 +1 
v) PPDS ROD, g@tD por 


vi) Urt- A рез» 


(6.73) 


> 


о4%% 


vii) 4% "= рез Dr уату: 


The algorithm is relatively economical since only one matrix-vector multiplication, 
step (vi), is required. It may be noted that the residual, К"*1, is evaluated 
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recursively, from step (ii), instead of from R"*U 2 Bg— 4 V'^*', The parameter 
o^ * is monitored to indicate proximity to convergence. In contrast to the SOR 
scheme (Sect. 6.3.1) there are no empirically chosen parameters in the conjugate 
gradient method. 

In (6.73) P'? constitutes a search direction which at the beginning of the 
iteration is given by P(9 = ©), The choice for a'"^* P in (6.73) ensures that the 
search directions Р 9 are mutually A-conjugate (orthogonal) in the sense that 


РФТАРӘ-0 for izj. (6.74) 


In addition the overall algorithm ensures that R?T RO 0 for 15]. These feature 
produce a robust algorithm that, in the absence of round-off errors, would obtain 
the solution of a system of N linear algebraic equations (6.2) in no more than N 
iterations, The choice of А in (6.73) ensures that the equivalent quadratic function 


F(V)-05VTAV—V7T B (6.75) 


has a minimum in the Р“) direction at / 47), 

If the residual R is expanded as a series based on the eigenfunctions of A it is 
found that each step of the conjugate gradient method has the effect of approxi- 
mately eliminating the contribution from each eigenvector in turn (Jennings 1977b). 
Consequently if some of the eigenvalues of 4 are bunched, convergence is achieved 
in less than N iterations. 

In addition if it is possible to premultiply (6.2) so that the eigenvalues of the 
resulting matrix are more bunched than for А and the spread of the eigenvalues 
(Amin» ^max) is reduced, the subsequent application of the conjugate gradient method 
to the “preconditioned” system converges faster. The best choice for the precondi- 
tioner would be А7! since then the solution follows trivially. А good choice is N^ !, 
which appears in the general form of a stationary iterative technique (6.51). Thus 
(6.2) is replaced by 


N^IAV-N^OB. (6.76) 


To use the above conjugate gradient method it is necessary to make N^ !A 
symmetric. This can be done by forming ИМ! АИ’! where W is chosen, 
typically, so that WTW = №. Consequently (6.76) is replaced by 


B , where (6.77) 
-WN-AW^, V=WV and B-WN^!B. 


The application of the conjugate gradient method to (6.77) is called a pre- 
conditioned conjugate method and, with a good choice for N, will produce more 
rapid convergence than the direct application to (6.2). 
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In practice (6.77) is not formed explicitly. Instead (6.73) is replaced by 

i) үе" po, 2 ро) , 

ii) ont 1) — 50). 10 Uo ; Ret 1) — W +0 WN' !g^*» ; 
iii) og" * » (RCD gern , 
іу) a" * Da o"t gt , (6.78) 
у) Рети ето ү qot 0p ; Peto WPD, 

vi) U^*0-N-1A4p0o*D ; + йет), 
о” 1) 


(Pet D)r Газ 1) ^ 


In the above sequence, ó and В play the roles of pseudoresiduals and the value of 
о”? indicates the closeness to convergence. 

Different choices of N, and hence W, correspond to different iteration schemes 
in Sect, 6.3.1. Thus (6.78) can be interpreted as a conjugate gradient acceleration of 
the basic scheme, Àn alternative, but very effective, preconditioner can be construc- 
ted by approximately factorising А into lower triangular, L, and upper triangular, 
U, matrices (Jackson and Robinson 1985). 

For a finite difference discretisation of the steady transport equation, (9.81) 
without the time-dependent term, the resulting system of equations, equivalent to 
(6.2), will not be symmetric due to the influence of the discretised first derivative 
terms, и2Т/дх and одТ/ду. The conjugate gradient method can still be used with 
modifications (Axelsson and Gustafsson 1979). | 

One such scheme, ORTHOMIN (due to Vinsome 1976), is described here. In 
the presence notation (6.78) is replaced by 


үй) 26802 


ОНО pm | 


po-gma y a) pO | — where (6.79) 
5% = №! к 
"E (ZAJ po 
 (ZAP9r po ' 
Z-(N 4y . 


If all terms are retained in the Ў in the expression for P the method requires 
i 


considerable storage and is rather uneconomic, In practice good performance is 
obtained with і-0,...,4. 

Markham (1984) has compared various preconditioned conjugate gradient- 
type methods for asymmetric systems of equations. ORTHOMIN is effective but 
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Markham finds that the biconjugate gradient method of Fletcher (1976) and 
variants of it are more efficient. However Jackson and Robinson (1985) find that 
minimum residual methods are more effective than the biconjugate gradient 
method if the test problem is very asymmetric. Thus there is no universally 
preferred conjugate gradient method if A is of general form. 

It is possible to symmetrise a general matrix equation (6.2) by replacing it with 


ATAV=A'B. (6.80) 


This approach is not generally recommended because if A is at all ill-conditioned 
АТА will be even more so, with a consequent loss of accuracy when (6.80) is solved 
for V. However Khosla and Rubin (1981) have solved the equations governing 
incompressible viscous flow past a circular cylinder at Re=100 by symmetrising 
the governing equations as in (6.80) and applying a SIP (Sect. 6.3.3) preconditioned 
conjugate gradient method. The SIP method alone requires approximately 240 
iterations for convergence оп a 61 х 24 grid, whereas the SIP preconditioned 
conjugate gradient method requires only about 40 iterations for convergence. 


6.3.5 Multigrid Methods 


These methods are applicable to both linear systems (6.2) and nonlinear systems 
(6.1). A typical method will be described initially for linear systems, the extension to 
nonlinear systems being described subsequently. 

Multigrid methods work with a sequence of grids m=1,... , М, where the grid 


size ratio 4,,1/4,--0.5. The linear system to be solved on the finest grid is written 


AM VM — BM | (6.81) 





An approximation to V is provided by the solution on the next coarser grid, 
y^. Similarly for an intermediate grid the solution, V", is a good approximation 
to the solution on the next finer grid, V^* !. 

If an approximation to the solution of 4"*! V™*+!=B™*! is denoted by 
y"*1^ so that 


рту ута mti ед (6.82) 
А" рут Веі  4т+1 yrti.a— рт | (6.83) 


The correction, W"*!, and the residual (or defect), R"* !, are closely approximated 
by the correction and residual on the next coarser grid, И” and R", if they are 
smooth enough, i.e. if the amplitudes of high frequency components are small. The 
highest frequency that can be represented on a discrete grid, A m, is 24 „ (Sect. 3.4.1). 

Relaxation (iterative) procedures, such as Jacobi, Gauss-Seidel and SOR de- 
scribed in Sect, 6.3.1, remove high frequency components іп a few iterations. It is 
the removal of the low frequency components of the error, and equivalently of the 
residual, that causes the slow convergence of relaxation methods on a fixed grid. 
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However, a low frequency component on a fine grid becomes a high frequency 
component on a coarse grid, Consequently multigrid methods seek to exploit the 
high frequency smoothing of relaxation methods in the following way. 

For a given (т + 1)-th grid a few (v!) relaxation steps are made to smooth the 
high frequency components in the correction and in the residual. From (6.83) this 
can be written symbolically as 


m+1,¥1__ м (Ит gqmtl ртт 
Ww = RELAX" (W"*!, 4"*, R"*!) апа (6.84) 


"урт!  4дт+1 нут.» 


For a linear system of equations the relaxation can be applied to either the original 
equation (6.2) or to the residual/correction equation (6.83), For nonlinear systems 
of equations the relaxation must be applied to the original equation (6.1). 

The equivalent residual distribution on the mth grid, В", is computed from 
К" +1." by a restriction operator 17, |, This can be written symbolically as 


К"=1",, R", (6.85) 


Appropriate forms of the restriction operator will be indicated later. The process of 
relaxation and restriction is continued until the coarsest grid, m=1 is reached. 

On the coarsest grid it is economical to continue the relaxation (or any other 
iterative method) until the converged correction, W', is obtained. Alternatively 
A! W!=R' or (6.2) can be solved by a direct method (Sect. 6.2). It may be noted 
that W! is non-zero if Ё! is non-zero, in general. 

The correction W?, can be obtained from W! by prolongation (interpolation). 
This is written symbolically as W?—I2W! or, more generally, 


трт рт. (6.86) 


Appropriate forms of the prolongation operator 1771 will be indicated later. On 
the finer grid, m+1, a few (у?) relaxation steps are made, ie. 


wnt LY. RELAX" (W"*!, amt, К") , . (6.87) 


before prolongating to the next finer grid. 

A single multigrid cycle starting from the finest 219, т-- М, consists of the 
repeated application of (6.84 and 85) until the coarsest grid is reached, the exact or 
complete iterative solution of 4! И! = R! and the repeated application of (6.86 and 
87) until the finest grid, m — M, is again reached. This is called a V-cycle and a flow 
chart is shown in Fig. 6.21. 

The cycle is repeated until satisfactory convergence on the finest grid is 
achieved. Ten cycles to reduce the change in the solution to 1075 for successive 
cycles might be typical if (6.2) were a Poisson equation on a uniform grid. For the 
first cycle И” is set equal to the initial guess of the fine-grid solution, И", 

The restriction operator, Г». , in (6.85), permits the residual distribution on the 
coarser grid R" to be constructed from the residual distribution on the finer grid 
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Fig. 6.21. Flow chart for the multigrid V-cycle 


К" +! On a physical grid, with j and k denoting grid point locations in x and y on 
the coarser grid, the simplest choice is given by direct injection, К", = R7, '. This is 
stable and computationally efficient for strongly elliptic problems, e.g. Laplace's 
equation in Sect. 5.2.3. However for weakly elliptic nonlinear problems, i.e. high Re 
viscous flow (Chap. 17), a more stable choice is given by the five- and nine-point 
operators, 


121 
242 (6.88) 
121 | 


The prolongation (interpolation) operator іп (6.86) is usually based on bilinear 
interpolation in two dimensions. Thus if the j, К grid point labels are associated 
with the coarser, m, grid then 
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wW =W], (6.89a) 
Wie gk m O (wp WHE 14) › (6.89b) 
Weegee, = 0.25 (Wit Weert Wher T ИЙ cet) . (6.89c) 


The w"*! at points ( j, К+ 1) are handled as in (6.89b). It should be noted that 
grid points (j, К) on the coarser grid will be relabelled (27+ 1, 2k -- 1) on the finer 
grid. 

For nonlinear problems it is possible to combine a multigrid method with 
Newton's method (Sect. 6.1.1). A linear system is solved for the correction to the 
solution at every step of the Newton iteration (6.6). The multigrid algorithm in 
Fig. 6.21 can be applied iteratively to solve (6.6). 

However, a multigrid method can be applied directly to the nonlinear system 
(6.1) without ut thé need for introducing Newton's method. ' The essential difference 
from the algorithm « described in Fig. 6.21 is that at the restriction stage, (6.85), both. 
К" +17 and the approximate solution, yr la кейіне Бе! restricted. onto the co: coarser 
grid m. At the mth level a solution of 


А" рта gp" A"IT +, y"*t а (б.90) 








is sought in place of (6.83). 

In practice the solution И" is obtained by relaxation and further restriction to 
the coarsest grid, exact solution on the coarsest grid and then relaxation and 
prolongation to successively finer grids, in a manner equivalent to that shown in 
Fig. 6.21. Once the solution V"? is available, the correction 


"= рта рт рт | (6.91) 


is computed and prolongated to the (m+1)-th grid. There a revised solution, 
y"*i2.-["*! ym is relaxed to give V"*+!-” essentially as in the linear case. 

The nonlinear scheme is referred to as the full approximation storage (FAS) 
method by Brandt (1977) to distinguish it from the correction storage (CS) method 
given in Fig. 6.21. Since the FAS method passes both the solution and the residual 
to coarser grids it is slightly more expensive (about 5%-10%) than the CS scheme; 
generally it should not be used for solving linear problems. Like the CS method, the 
FAS method relies on the smoothing of the solution correction W and the residual 
R for part of its effectiveness. 

The description of both the CS and the FAS methods has started with an 
approximation to the solution on the finest grid, т = M, and has followed a V-cycle 
to the coarsest grid and back again to improve that solution. This is an appropriate 
strategy if a good approximation to the solution, V™, is available. 

If nothing is known about the solution it is better to start with the solution on 
the coarsest grid, obtained either directly or by (conventional) iteration. A high- 
order interpolation is made to the next finer grid and r multigrid V-cycles are 
applied to improve the solution. The high-order interpolation plus r multigrid V- 
cycle process is repeated on successively finer grids until т= M is reached, and the 
multigrid V-cycle is repeated until convergence. 


6.3 Iterative Methods 207 


This is the basis of the full multigrid (FMG) method, which is a generalisation of 
both the FAS and CS methods. The FMG method is shown schematically in 
Fig. 6.22 for a linear system of equations. The function INT(V™, m) represents a 
cubic interpolation of V" first іп the x direction and then, for all х"* P? grid points, 
in the y direction. The function MGI(V"*'*!, А”, B") denotes one pass through the 
multigrid V-cycle shown in Fig. 6.21. 
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Solution of А“ ув” 


The multigrid strategy can be viewed as ап acceleration technique like the 
conjugate gradient method (Sect.6.3.4) applied to a basic iterative scheme 
(Sects. 6.3.1—6.3.3). Alternatively the basic iterative scheme is interpreted as а 
smoother for the multigrid method just as it was treated as a preconditioner for the 
conjugate gradient method. As with conjugate gradient acceleration, the use of an 
incomplete (approximate) line L.U factorisation of A for the smoothing process 
(6.84, 87) is found to be very effective (Sonneveld et al. 1985). This is particularly the 
case when A is very asymmetric, i.e. for convection-dominated problems (Chaps. 9 
and 10). 

Multigrid methods are described in detail by Brandt (1977), Stuben and 
Trottenberg (1982) and Hackbusch (1985), amongst others. Brandt, Stuben and 
Trottenberg and Briggs (1987a) provide FORTRAN programs for a simple 
multigrid method applied to a Poisson equation. The description given here follows 
Stuben and Trottenberg primarily; but Briggs (1987b) probably provides an easier 
orientation. Applications of multigrid methods to inviscid flow are indicated in 
Sects. 14.2.9 and 14.3.5 and to viscous incompressible flow in Sects. 17.2.3 and 
17.3.1. 


Fig. 6.22. Flow chart for the FMG method 
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6.4 Pseudotransient Method 


An alternative to the solution of the algebraic equations produced by discretising 
the steady problem is to construct an equivalent unsteady problem and to march 
the transient solution until the steady state is reached. Time then plays the role of 
an iteration parameter. 

Thus if Laplace’s equation 


925 025 


—TL- 6.92 
axi ду? у (6.92) 


is discretised using centred finite difference formulae with Ах= Лу, the Jacobi 
method gives, for internal nodes (on a j, k grid as in Fig. 6.1), 


(pk D= 0.25(vf +1+ vt it of) eto sy) . (б.93) 


То generate an equivalent unsteady method, (6.92) would Бе replaced Бу 


00 (24-24). (6.94) 


— = а — — 
д дх? ду? 

Discretisation using finite difference formulae and Ах= Лу gives the algorithm 
ру m (10—45)05 ,-F S(05 0а 005) 5 (6.95) 


where s —«4t/Ax?. The choice s — 0.25 reproduces (6.93). Thus there is a clear link 
between the unsteady formulation and the iterative techniques considered in 
Sect. 6.3. 

À major advantage of the unsteady formulation is that it makes available the 
ADI method would generate equations (8.14 and 15). The solution of these 
equations is obtained by solving the subsystems of equations associated with each 
grid line using the Thomas algorithm (Sect. 6.2.2). This is illustrated in Fig. 6.23. 

Since the transient solution is not of interest it is customary to choose a 
sequence of time-steps to minimise the number of time-steps to convergence. The 
sequence is problem dependent but is typically geometric with the range of time- 
steps corresponding to the range of eigenvalues associated with the x-gridline and 
y-gridline tridiagonal matrices (Wachpress 1966, Chap. 6). 

For flow problems it is often found that the approach to the steady state is 
considerably more rapid in some parts of the computational domain than in others. 
For the flow over a backward-facing step (Sect. 17.3.3) the solution in the separated 
flow region behind the step is much slower to converge than in the regions far from 
the step. The spatial dependence of the convergence rate suggests the following 
modification to (6.94): 

ду 925 025 
ааа аа). (6.96) 
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Fig. 6.23. Pseudotransient method with split- 
ting for a two-dimensional problem 
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where c(x, y) is adjusted empirically to balance the convergence rate in the different 
parts of the computational domain. The splitting scheme is then applied in the 
conventional manner (Sect. 8.2). 


6.4.1 Two-Dimensional, Steady Burgers" Equations 


Here we illustrate the effectiveness of the pseudotransient method by combining it 
with Newton's method applied to the two-dimensional steady Burgers' equations 
(Sect. 6.1.3). The unsteady equivalents of (6.12) are 


Gu ou 500 1 ou Ou _0 
ðt 0х ду Ве\дх? ay? J | (6.97) 
0, 08 1 (2909) о 
ôt dx ду Ве\дх? dy?/ 
The discretised form of (6.97), equivalent to (6.13), is 
изу — uk ntl. 
a CR 50, (6.98) 
071—034 TRU =0 , 


At 


where Кизү! and Коту! are given by (6.13). 

It may be noted that the steady-state terms have been evaluated at time-level 
n+ 1. This is to make the pseudotransient form compatible with Newton’s method 
and to allow the time level n+ 1 to be directly equivalenced to the iteration level in 
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Newton’s method (Sect. 6.1.1). An examination of (6.13) indicates that Ru; and 
Rv, are functions of uf {', ить, vj j, à, etc. Therefore expanding Ки?! and 
Rv i! about time (iteration) level n, as in Sect. 8.22, allows (6.98) to be written as 








дить пат u" un n 
Деу! Rv; ү 
" * | 44, „=0 
At + j.k + да, m 41, т , 


where Au" +1 u^ * 1 — и", etc. The term д includes both и and v and the indices J, m 
denote all possible values of j and К that lead to non-zero partial derivatives. That 
is !=j—1, j, j-- 1; m- k — 1, k, k - 1. Equations (6.99) сап be combined into a single 
equation as 


1 
(-ш-- К, (6.100) 


which may be compared with the conventional Newton’s method (6.17). The effect 
of choosing small values of At is to make the augmented Jacobian (7/4t+J) more 
diagonally dominant. 

Equation (6.100) is a more effective means of "under-relaxing" Newton's 
method than (6.19) For the solution of Burgers’ equation a typical output 
produced by the program NEWTBU based on (6.100) is shown in Fig. 6.24. It is 


NEWTONS METHOD FOR N = 18 ТТИХ= 50 IRD= 1 IPN= 5 
DT= .10000E-01 EPS= .100Е-04 RE= .100Е+02 ОН= .100Е%01 
А = 110.13 110.13 .00 .00 1.00 

UEx .9990 .9586 4888 -.0559 -.0991 
UE= .9990 .9583 .4863 -.0565 -.0991 
UEx .9990 .9572 .4786 -.0584 -.0991 
UEx .9989 .9553 .4655 -.0614 -.0992 
UEx ‚9988 ‚9524 .4462 -.0657 -.0992 
VE= .0000 .0000 .0000 .0000 .0000 
VE= 21317 01277 .0753 .0090 ‚0012 
VE= .2679 .2598 .1515 .0179 .0023 
VER .4142 .4010 .2297 .0266 .0034 
VE= .5774 .5577 .3109 .0349 . 0045 


IT» 0 RMS= .1496D+00 R= -.21690+00 -.2850D-01 -.21880+00 
И= 5 RMS* .1756D-01 Вә -.10710-01 -.23090-02 -.1703D-01 
IT» 10 RMS= .2063D-02 R= -.1072D-02 -.2097D-03 -.1748D-02 
IT» 15 RMS= .2465D-03 R= -.1160D-03 -.2103D-04 -.1905D-03 
IT- 20 RMS- .29630-04 R= -.1308D-04 -.2219D-05 -.2157D-04 


AFTER 23 ITERATIONS THE RMS RESIDUAL IS .831740-05 


Ux „9990 .9586 4888 -.0559 -.0991 
Us .9990 .9605 4835 -.0567 -.0991 

Us 19990 29603 478 -.0588 -.0991 

Us 29989 29577 14625: -.0615 -.0992 

Us 29988 29524 442 -.0657 -10992 

Үш (0000 .0000 .0000 "0000 "0000 

y= 21217 11282 107148 -0090 .0012 | 

y= 22679 2605 11507 10179 .0023 |. | 

Y- .4142 .4017 .2290 .0266 0034 Fig 6.24. Typical output from pro- 


Үш .5774 .5577 .3109 .0349 .0045 gram NEWTBU, incorporating (6.100) 
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apparent that convergence is much more rapid with (6.100) than with (6.17, 19). The 
convergence of (6.100) is dependent on the choice of At. A large value of At implies 
convergence like the full Newton’s method. But for the two-dimensional Burgers’ 
equations this implies a divergence (Sect. 6.1.3). A small value of At implies a slower 
rate of convergence but has the advantage of greatly expanding the radius of 
convergence. This overcomes a major disadvantage of Newton's method in Table 
6.5. Aslong as the equations being solved, (6.98), have a unique solution, choosing a 
sufficiently small At will find the solution from any reasonable starting solution. 

The modifications to NEWTBU to implement (6.100) are given by lines 36 and 
39 of subroutine JACBU (Fig. 6.13). As implemented in NEWTBU the major 
disadvantages of Newton's method are associated with the lack of economy of 
forming J and factorising the augmented Jacobian (7/At-* J}. The factorisation of 
Jis an O(N?) process which is prohibitively expensive if N is large. In addition, the 
retention of all locations of J in main memory for reasonable economy places a 
severe limitation on the largest system of equations that can be solved. 

Many entries in J are zero, and of those that are non-zero, only a few are large. 
This implies, in the present problem, that Ru and Rv are strong functions of some 
nodal unknowns u;,, v;, but very weak functions of other nodal unknowns. 
Consequently it is desirable to consider approximations to J that capture the 
strong dependence, but ignore the weak dependence if this leads to smaller 
Jacobians or structures of J that lend themselves to approximate factorisation 
(Sect. 8.2) in an efficient manner. The price to be paid may be more iterations to 
reach convergence. But if each iteration is sufficiently economical the price is 
acceptable. 

When the approximate factorisation (splitting) techniques described in Sect. 8.2 
are applied to nonlinear problems they can be related to an approximate factoris- 
ation of (6.100). This aspect is clarified in Sect. 10.4, where the two-dimensional 
Burgers’ equations (the current problem) are solved by an approximate factoris- 
ation of (6.100). 


6.5 Strategies for Steady Problems 


Most flow problems are governed by systems of coupled partial differential 
equations (Chap.11). For flow formulations involving velocity potential 
(Sect. 14.3), stream function (Sect. 17.3) or incompressible flows involving pressure 
(Sects. 17.1 and 17.2) опе of the governing equations is typically diagonally 
dominant. In some instances this equation may also be linear. Consequently the 
iterative techniques (Sect. 6.3) are then applicable, particularly multigrid methods 
(Sect. 6.3.5), as are the special direct solvers (Sect. 6.2.6) if the grid is uniform. 
When velocities or vorticity appear as dependent variables the governing 
equations usually include nonlinear convective terms (Chap. 10). Discretisation of 
such equations produce nonlinear systems of equations like (6.1) that are not 
diagonally dominant, unless the (viscous) diffusion terms are much larger than the 
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convective terms. The choice of solution method is then often between a Newton or 
quasi-Newton solver and a pseudotransient, approximate factorisation solver. The 
relative strengths and weaknesses of the two approaches are summarised in Tables 
6.5 and 6.6. 


Table 6.5. Newton’s method 


Advantages: (i) Rapid convergence (few iterations) 
(ii) Can be modified to overcome many explicit disadvantages 
(іі) Approximate solution can be exploited 


Disadvantages: (i) Small radius of convergence if large number of unknowns 
(il) Factorisation of J at each iteration is computationally expensive 
(іш) Factorisation of J requires large memory for ІШІ storage 
(iv) Fails to converge if J becomes ill-conditioned 


Table 6.6. Pseudotransient formulation plus approximate factorisation 


Advantages: (i) Each iteration is economical 
(ii) Very large radius of convergence 
{iii} Small memory requirement (splitting uncouples gridlines) 
(iv) Detects if problems not steady 
(v) Approximate solution can be exploited 


Disadvantage: (i) Slow convergence (large number of iterations) 


For viscous flow problems at high Reynolds number (Chaps. 17 and 18) J 
becomes ill-conditioned and this often prevents convergence of Newton's method 
even when starting from a nearby converged solution. 

For some problems it is not known, a priori, if a steady solution exists. By 
constructing an unsteady formulation am oscillatory pseudosteady solution can be 
automatically detected. 

Historically Newton's method has been used more often with finite element 
methods and the pseudotransient splitting approach has been used more often with 
finite difference and finite volume methods. However the pseudotransient approach 
can be interpreted as providing a diagonally augmented system of equations to be 
solved iteratively, as in (6.100). Consequently, even if the augmented equations are 
not formally diagonally dominant (6.52) they are suitable for solution using 
multigrid techniques (Sect. 6.3.5) and usually produce more rapid convergence. 


6.6 Closure 


In this chapter we have briefly reviewed techniques for solving the algebraic 
equations that result from discretisation (Sect. 3.1), particularly for steady prob- 
lems. 
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Тһе algebraic equations produced by discretising the fluid dynamic governing 
equations (Chaps. 14-18) are nonlinear, typically. Consequently some iterative 
procedure is inevitable. It is conceptually useful to think of an outer iteration to 
cope with the nonlinear nature of the discretised equations. At each step of the 
outer iteration a linear system of equations is solved. This system may be solved by 
direct (Sect. 6.2) or iterative methods (Sect. 6.3). 

Newton’s method (Sect. 6.1.1) and the pseudotransient formulation (Sect. 6.4) 
are candidates for the outer iteration. Although Newton’s method (Sect. 6.1.1) 
achieves quadratic convergence when close to the solution, its small radius of 
convergence for large numbers of nodal unknowns makes it a less effective method, 
in its basic form, than the pseudotransient formulation (Sect. 6.4). 

For multidimensional problems the value of split implicit schemes (Sect. 8.2) is 
directly proportional to the efficiency of solving systems of equations that are 
narrowly banded (Sect. 6.2.2-6.2.5) at each stage of the outer iteration. 

Iterative techniques at each stage of the outer iteration are usually most 
effective for systems that are strongly diagonally dominant. This often occurs with 
discretisations of the transonic inviscid flow governing equations (Sect. 14.3) and 
this has led to the development of special iterative techniques (Sect. 14.3.5). How- 
ever a particular feature of the multigrid technique (Sect. 6.3.5) is that it permits 
solution on the coarsest grid by a direct method in an economical manner. Thus 
the multigrid technique is available whether the system of equations is diagonally 
dominant or not. 


6.7 Problems 


Nonlinear Steady Problems (Sect. 6.1) 


61 For the solar collector problem considered in program NEWTON, 
introduce modifications to calculate J in (6.10) only every p steps. Compare 
the number of iterations to convergence and the overall operation count for 
various values of p. It will be more meaningful to start from a solution for 7 9 
that requires a larger number of iterations than in Fig. 6.8 to reach conver- 
gence. 

6.2 Run program NEWTBU for values of Ве= 5, 2, 1. For each case pick the 
value of œw that produces the most rapid convergence. What do you notice 
about the dependence of the number of iterations to convergence on Re? Can 
you correlate this with the relative magnitude of terms in (6.12) and the 
consequent diagonal dominance of J in (6.17)? 

6.3 Modify program NEWTBU so that the solution is chosen from 


4910 = 49 + о, 4464509 ; 


where а», corresponds to the minimum R,,, in the search direction defined by 
44" * U, Compare the number of iterations and CPU time to convergence with 
the results shown in Figs. 6.14 and 6.24. 
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Direct Method for Linear Systems (Sect. 6.2) 


6.4 


6.5 


6.6 


Write a computer program to solve (6.27) with R,,,,=0.5 using subroutines 
BANFAC and BANSOL for р, ј=2,..., 10. Boundary conditions are 
v, =0, v, —1.0. This problem has the exact solution (9.46). 


= —0,0060834 +0.00365 (5/3)/ for ј=1,...,11. (6.101) 


= 


A five-point scheme corresponding to (6.27) is 


(1+ Reen) 05-2802 + Reen)¥j;-1 + 300; — 8(2 — Ree); 41 
+(1—Ry)0;+2=0 . (6.102) 


Modify subroutines BANFAC апа BANSOL to solve the pentadiagonal 
system found from the above equation. Assume sufficient boundary con- 
ditions are available to evaluate vj- z, 0; ү at the left-hand boundary and р, + 1, 
v;+2 at the right-hand boundary. Implement this scheme for j=3,...,9 and 
Roen 1.0. Use 


vj2(e199 —1)(e!9—1) with x;-0.1(j— 1) (6.103) 


to obtain boundary values for v,, v;, Vio and v,,. Equation (6.103) provides 
the exact solution of the problem which (6.102) discretises. Thus the solution 
of (6.102) should be close to (6.103). 

Write a computer program to implement the solution of block tridiagonal 
systems of equations with 2 x 2 blocks. Test by discretising. 

ат „dS d?S dT 

42 -22 =0 and 13 —0.5 dx -0 (6.104) 
for boundary conditions T=0 and S=1 at x 20, T=2sinh(1), S=cosh(1) at 
x= 1 using three-point centred finite difference formulae. Obtain solutions in 
the interval 0<x<1.0 with 4x=0.1. The exact solution to (6.104), and the 
Dirichlet boundary conditions given, is T=2sinh(x), S=cosh(x). 


Iterative Methods (Sect. 6.3) 


6.7 


6.8 


For b/a=3 and 10 obtain solutions to the duct problem (Sect. 6.3.2) corre- 
sponding to Table 6.3 for the Jacobi, Gauss-Seidel and SOR methods applied 
with finite difference discretisation. What is the effect of increasing aspect 
ratio? 

Modify program DUCT to introduce the ADI scheme (6.66, 67) and 
confirm the results shown in Table 6.3. Convergence is often faster if a 
sequence of iteration parameters, 24”, is used. Determine the effectiveness of 
the following choice, for the case b/a=1, Ax = Ay: 


A= 05 | 1= 1 N-1 (6.105) 
> Asin(U5n]/N))  — 7777 i 


6.9 
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where М is the number of divisions on each side of the duct. Equation (6.105) 
gives a cycle, which is repeated if required. The appropriate strategy for 
choosing the iteration parameter sequence is discussed by Varga (1962) and 
Wachpress (1966). 

Apply a F-cycle multigrid method to the duct problem (Sect. 6.3.2) for 
conditions corresponding to Table 6.4 and compare the number of iterations 
to convergence and the overall operation count with the results shown in 
Table 6.4. 


Pseudotransient Methods (Sect. 6.4) | 
6.10 Apply the pseudotransient Newton’s method (program NEWTBU) with J 


6.11 


only evaluated every p iterations. Determine the influence of p on the number 
of iterations to convergence and the overall operation count. 

Modify program NEWTBU so that individual elements of J are set to 
zero if they are less than TOL. For TOL = 107°, 10-3, 10^ !, determine how 
many elements of J are set to zero and the influence on the number of 
iterations to convergence. Would the resulting pattern of non-zero elements in 
J permit a more efficient solution of (6.100) than using subroutines FACT and 
SOLVE; for example using an iterative technique based on (6.51). 


6.12 Can program NEWTBU be made to converge faster if a sequence of At's is 


used? Introduce a geometric cycle of time steps 
At=Atr 1=1,..., р, 


where the geometric ratio, rz: 1.2 and ра 5 to 40. Experiment with different 
combinations of 4/0), r and p. 


7. One-Dimensional Diffusion Equation 


From a computational perspective the diffusion equation contains the same 
dissipative mechanism as is found in flow problems with significant viscous or heat 
conduction effects. Consequently computational techniques that are effective for 
the diffusion equation will provide guidance in choosing appropriate algorithms 
for viscous fluid flow (Chaps. 15~18). 

In this chapter the one-dimensional diffusion equation will be used as a vehicle 
for developing explicit and implicit schemes. Attention will be given to the stability 
and accuracy of the various schemes. The problem of accurately implementing 
boundary and initial conditions will also be considered. 

The one-dimensional diffusion or heat conduction equation 


oF әт 


has already been introduced as a model parabolic partial differential equation 
(Sect.2.3) and used to illustrate the discretisation process (Sect.3.1) and the 
implementation of the finite difference method (Sect. 3.5). 

In (7.1) T may be interpreted as the velocity, vorticity, temperature or concen- 
tration depending on whether the diffusion of momentum, vorticity, heat or mass is 
being considered. If 7 is the temperature, (7.1) governs the flow of heat in a rod 
which is insulated along its edges but can transfer heat to the surroundings via the 
ends of the rod (A and B in Fig. 7.1). 

Two types of boundary condition are common. First the dependent variable is 
a known function of time. In the notation of (7.1) this would be (for the end A) 


T(0,t)=b(t) . (7.2) 


Tol ! 
x 
T or А В Т, or 
A ыы insulotion Аы B 
ТАХ Әх ӘТь,/Әх 
given given Fig. 7.1. One-dimensional, unsteady 


x=0 x=] heat conduction 
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This is a Dirichlet boundary condition (Sect. 2.1.2). In practice b is often a constant. 
Second, the spatial derivative of the dependent variable may be specified. To suit 
(7.1) this could be written (for the end A), 


oT 
— (0, = c(t) . . 
5, (90-сі) (13) 
This is a Neumann boundary condition (Sect. 2.1.2). As with Dirichlet boundary 
conditions, c is often a constant. 

To obtain unique solutions of (7.1) it is also necessary to specify initial 
conditions. These are given by 


Т(х,0)= To(x) . (7.4) 


The exact solution Т(х, t) satisfies (7.1) in conjunction with (7.4) and (7.2) or (7.3) 
applied at x 20 and х= 1.0. 

To obtain the approximate solution, (7.1) is discretised (Sect. 3.1) and the 
resulting algebraic equation 15 manipulated to generate an algorithm. The al- 
gorithm gives the solution at the (n + 1)-th time level (Fig. 3.2) in terms of the known 
solution at the nth and earlier time levels. The overall procedure is described in 
Sect. 3.5. 


7.1 Explicit Methods 


For explicit methods a single unknown, e.g. Т"*!, appears on the left hand side of 
the algebraic formula resulting from discretisation. 


7.1.1 FTCS Scheme 


If a two-point forward difference approximation is introduced for the time de- 
rivative and a three-point centred difference approximation is introduced for the 
spatial derivative in (7.1), the result is 


TT -T) «TLa-OT]é Ths) 


=0. 7.5 
At Ax? 0 (7.5) 


Equation (7.5) will be referred to as the FTCS (forward time centred space) scheme. 
It can be seen that the spatial derivative has been evaluated at the nth time level, i.e. 
at a known time level. Rearranging (7.5) gives the algorithm 


T5*!-sT5 ,«(1-2s) T2 -sT7,1, (7.6) 


where the discretisation parameter s=a At/Ax? . 
The grid points connected together by (7.6) are shown in Fig. 7.2. Substitution 
of Т, the exact solution of (7.1), into (7.5) and expansion of the various terms as а 
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n+] | n+] n+l 
n : n n 


jr joda ja j ja 
ЕТС5 Richordson Du Fort-Fronkel 


1 

i-1 T 

t J) Lineor F.E.M./ 
Crank-Nicolson ЗЕЕ! Cronk-Nicolson 


Fig. 7.2. Active nodes for diffusion equation algebraic schemes 


Taylor series about the ( j, n)-th node (as in Sect. 3.2.1) gives 


Т PTT OX 
ЕЕЕ , 


where the truncation error Е" is given by 


E QT „4х oT 


Е=| Z2 
2 дг “ту дх* 


j 5 | our. Ах“) . (7.7) 
It can be seen that (7.5) is consistent (Sect. 4.2) with (7.1). 

From the leading term іп Е; the FTCS scheme will be referred to as being first- 
order accurate in time and second-order accurate in space. However, it should be 
remembered that this statement is strictly only correct in the limit At, Ax-0. In 
practice, solutions are obtained on a grid of finite spacings and the magnitude of 
terms like 227/01? is not known, a priori. 

Application of the von Neumann stability analysis (Sect. 4.3) indicates that the 
amplification factor G is given by 


9 
G-1-4ssin?( = 
ssin (5) , 


where s=aAt/Ax? апа 6 = тлАх. For any value of 6, |G|<1 if s < 0.5, Thus (7.6) will 
produce stable solutions if s<0.5. It has already been seen (Sect. 4.2.1) that the 
particular choice s= 1/6 introduces a cancellation of terms in the truncation error 
and (7.6) then has a truncation error of O(At?, 4х“). 

The above properties of the FTCS scheme applied to the diffusion equation 
(7.1) are summarised in Table 7.1. It may be noted that the form of the truncation 
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error shown in Table 7.1 is equivalent to (7.7). The accuracy of numerical solutions 
using the FTCS scheme is shown in Table 7.3. 


7.1.2 Richardson and DuFort-Frankel Schemes 


In (7.6) the use of a two-point one-sided difference formula produces a first-order 
contribution to the truncation error and the use of a three-point centred difference 
formula produces a second-order contribution to the truncation error. Therefore a 
logical improvement to (7.6) would be the scheme 


THATS! «(TLa-2TMOT4na _ 


241 Ax? 0, (78) 


due to Richardson (Fig. 7.2). However, although the scheme is of O(At?, Ax?), a von 
Neumann stability analysis (Noye 1983, p. 138) indicates that the scheme is 
unconditionally unstable for s>0. Thus it is of no practical use. It may be noted 
that the unstable behaviour refers to the equation as a whole. When the centred 
difference approximation for the time derivative is introduced into the convection 
equation (9.2) a stable algorithm can be obtained, (9.15). 

The Richardson scheme (7.8) can be modified to produce a stable algorithm. 
This is achieved by replacing 7% in (7.8) with 0.5(T5^ ! -- T5*!). The resulting 
equation is 


2At Ax? 9 (79) 


Equation (7.9), which is known as ће DuFort-Frankel scheme (Fig. 7.2), сап be 
manipulated to give the explicit algorithm 





2 1-2 
шын Ба ИИН арын (7.10) 
1+2s 1+25 


Тһе DuFort-Frankel scheme is three-level in time unless s=0.5, for which it 
coincides with the FTCS scheme. For a three-level scheme, two time-levels of the 
solution must be stored and an alternative two-level scheme is required for the first 
time step. 

Application of the von Neumann stability analysis (Sect. 4.3) to (7.10) produces 
the amplification factor С given in Table 7.1. Since |G| € 1 for any value of 0 with 
s>0, it follows that the DuFort-Frankel scheme is stable for any value of At. There 
is a price to pay for this very favorable stability result. A Taylor expansion of the 
exact solution substituted into (7.9) about the (j,n)-th node produces the result 


Т ёт Гат , 
(4-42 esa) as | ош Ax2)=0 . (1.11) 


Thus for consistency At/Ax must -+0 as At, Ах->0, i.e. it is required that At < Ax 
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for consistency. However «(4t/4x)? = 541 and we expect s to be of O(1) for diffusion 
problems. Therefore the DuFort-Frankel scheme is consistent with (7.1) but will be 
inaccurate if sAt is large. The alternative form of the truncation error (Table 7.1) 
indicates that if s = (1/12)1/2, the DuFort-Frankel scheme has a truncation error of 
O(Ax*). A corresponding solution accuracy is indicated in Table 7.3. 

From a practical point of view there is still an effective restriction on the size of 
At when using the DuFort-Frankel scheme, even though it arises from an accuracy 
restriction rather than a stability restriction, as with the FTCS scheme. 


7.13 Three-Level Scheme 


A general explicit three-level discretisation of (7.1) can be written as 

aT" * +ЬТ"-еТт 1-(а1,,ТҢ +eLe T *)=0, о (7.12) 
where | 

Lex T= (Tj-1—2T;+ Ty 1)/ 4x? . 


The parameters a, b, c, d and e may be determined by expanding each term in (7.12) 
as a Taylor series about node (j, п) and requiring that (7.12) is consistent with (7.1). 
Examples of this procedure are provided in Sects. 3.2.2 and 3.2.3. This procedure 
permits (7.12) to be rewritten with only two disposable constants, y and В, instead 
of five. Thus (7.12) is replaced by 


49505*!-T)) v(05-Tj) 


— — n "-11-- 
At At a[l P) Lax Ti+ ВТ ] 0. (7.13) 


A Taylor series expansion of (7.13) about node ( j, n) indicates consistency with (7.1) 
and a truncation error given by 


“Т 1 
Еа (058-1) Ох) , (7.14) 


where s—«At/Ax?. In (7.14) all time derivatives have been replaced with spatial 
derivatives, using the governing equation, as in Table 7.1. 
Clearly (7.13) has a truncation error of fourth order if f is given by 


1 
--05-у---. 7.15 
95-2 125 ( ) 


Equation (7.13) produces the algorithm 


1+2у y _ $ _ 
nti. | Te n п-і г E n Т" 1 , 7.16 
ЖЕСЕ 


where І,,Т,-1,. 1 -21;- Тр: 
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Equation (7.16) turns out to be only conditionally stable, with the maximum 
value of s for stability being a function of y. This can be established by applying the 
von Neumann stability analysis (Sect. 4.3) to (7.13). This requires solution of the 
following quadratic equation for G: 


с2(1-)-СП + 2y+2s(1 — f) (cos8 — 1)}+ [y —28s(cos0—1)]20 . (7.17) 


For stability it is necessary that |G| €1 for all values of 8. This generates the 
stability map as a function of y and s shown in Fig. 7.3. At y —0, (7.17) has a more 
restrictive stability limitation оп s (< 0.34) than the FTCS scheme. For very large 
values of y the stability limitation becomes sx 0.5, the same as for the FTCS 
scheme. The accuracy of the three-level scheme (7.13) is examined in Sect. 7.1.4. 


050 UNSTABLE 


0.40 STABLE 


0 2.0 40 60 Fig. 7.3. Stability map for (7.13 
Ү апа 15) 


7.1.4 DIFEX: Numerical Results for Explicit Schemes 


In this section the FTCS scheme (Sect. 7.1.1), the DuFort-Frankel scheme 
(Sect. 7.1.2) are compared with the three-level scheme (Sect. 7.1.3). All three 
methods are incorporated into program DIFEX (Fig. 7.4), which is an exten- 
sion of DIFF (Fig. 3.13). 

Solutions are sought in the computational domain Ox xx1.0 and 2.00xt 
59.00, with initial conditions given at t —2.00 by the exact solution (3.42) divided 
by 100. Boundary conditions are 


T(0,)=T(1,t)=1.0 . (7.18) 


The accuracy of the various schemes is assessed by evaluating the rms difference 
between the computed and exact solutions at T=9.00. The exact solution is 
computed in subroutine EXTRA (Fig. 7.5). For the DuFort-Frankel scheme 
and the three-level scheme two levels of initial data are required at t=2.00 апа 
t—2.00— At. 

The major parameters used by program DIFEX (Fig. 7.4) are defined in 
Table 7.2. A typical solution, produced by the three-level scheme on a relatively 
coarse grid, is shown in Fig. 7.6. 
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DIFEX SOLVES THE DIFFUSION (1D TRANSIENT HEAT CONDUCTION) 
EQUATION USING VARIOUS EXPLICIT SCHEMES 


DIMENSION TW(41),DUM(41),TD(41), TE(41) , TOL(41) , EL(3) 
DOUBLE PRECISION SUM,AVS,RMS,DSQRT,DMP 


OPEN (1,FILEs'DIFEX.DAT') 

OPEN(6,FILEs 'DIFEX.OUT') 

READ(1, 1)ME, IPR, JMAX, MAXEX , ИМАХ, ALPE, S, ТМАХ , TST, САМ 
1 FORMAT(5SIS,E10.3,F5.3,3F5.2) 


PI x 3.1415927 

JMAP = МАХ - 1 

AJM = ЛАР 

DELX = 1./AJM 

DELT = DELI*DELX*S/ALPH 


IF(ME .£Q. 1)WRITE(6,2) 
IF(ME .Е9. 2)WRITE(6,3) 
IF(ME .EQ. 3)WRITE(6,4) 

2 FORMAT(' FTCS SCHEME’ ,//) 

3 FORMAT(' DUFORT-FRANKEL SCHEME’ ,//) 

4 FORMAT(’ 3-LEVEL, 4TN-ORDER SCHEME’ ,//) 
VRITE(6,5) JMAX ,MAXEX, ИМАХ, THAX, TST, САИ 

5 FORMAT(' JMAX=’,IS,’ MAXEX=',IS,’ MMAX=’,I5, 
1’ TMAI-^,F5.2,' TST-',F5.2,' GaM=’,FS.2) 
WRITE(6,6)S, ALPH , DELT, DELI 

6 FORMAT(’ Ss',F5.3,' ALPE = ',E10.3,' DELT = ',Е10.3, 
1' DELI = ',E10.3,//) 


IF(ME .Е0. 2)AS = 2.*S/(1.*2.*S) 
IF(ME .Е0. 2)BS = (1. - 2.*S)/(1. + 2.*S) 
IF(ME .NE. 3)GOTO 7 
AS = (1.*2.*GAM) / (1. *GAM) 
BS = -GAM/(1.*GAM) 
CS = S/(1.*GAM) 
DS = 1.5 + САМ ~ 1./12./$ 
ES s -0.5 - GAM * 1./12./S 
EL(1) = 1. 
EL(2) = -2. 
EL(3) = 1. 
7T CONTINUE 


OBTAIN INITIAL CONDITIONS FROM EXACT SOLUTION 


T = TST - 2.*DELT 
DO 9 I = 1,2 
T =T + DELT 


CALL EXTRA(JMAX,MAYEX,DELI,PI,ALPH,T,TE) 


DO 8 J - 2,JMAP 
IF(I .Eg. 1)TOL(J) = TE(J)/100. 
IF(I .£g. 2)TN(J) = TE(J)/100. 
8 CONTINUE 
9 CONTINUE 


SET BOUNDARY CONDITIONS 


Fig. 7.4. Listing of program DIFEX 
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12 


13 
14 


15 


16 


17 


18 


19 


20 


21 


22 


23 


24 


TOL(1) = 1. 

TOL(JMAX) = 1. 

TN(1) = 1. 

ТИ(ЈМАХ) = 1. 

Тр(1) = 100. *TN(1) 
TD(JMAX) = 100.*TN(JMAX) 
rea 


EACH TIME STEP STARTS AT STATEMENT 10 


N31 
IF(ME .Е0. 176070 15 
IF(ME .Е0. 276070 13 


ME = 3, 3-LEV., 4TH ORDER 


DO 12 J =2, ЛАР 

DUM(J) s AS*TN(J) * BS*TOL(J) 
DO 11 К = 1,3 

KJ =J-2+K 


DUM(J) = DUM(J) + CS*EL(K)*(DS*TN(KJ) + ES*TOL(KJ)) 


CONTINUE 

CONTINUE 

GOTO 17 

ME = 2, DUFORT-FRANKEL 

DO 14 J = 2,JMAP 

DUM(J) = AS*(TN(J-1) + TH(J*1)) + BS*TOL(J) 
GOTO 17 

ME = 1, FTCS 


DO 16 J = 2, МАР 


DUM(J) = (1.-2.*S)*TN(J) + $+(ТН(1-1) + TH(J+1)) 


CONTINUE 

DO 18 J = 2,JMAP 

IF(ME „СТ. 1)TOL(J) = TN(J) 
TN(J) = DUM(J) 


DO 19 J - 2,JMAP 

TDCI) = 100, *TN(J) 

T= T + DELT 

IF(IPR :EQ. 1)WRITE(6,20)T, (TO(J2, J=1, МАГ) 
FORMAT(' T= ',F5.2,' TDz',11F6.2) 


* 


IF MAXIMUM TIME OR MAXIMUM NUMBER OF TIME-STEPS 


EXCEEDED EXIT FROM LOOP 


IF(N .GE. NMAX)GOTO 21 
IF(T .LT. TMAX)GOTO 10 


OBTAIN EXACT SOLUTION AND COMPARE 
SUM = 0. 
CALL EXTRACJMAX,MAXEX,DELX,PI,ALPE,T,TE) 


DO 22 J = 1,JMAX 

DMP = ТЕС)) - TDCI) 

SUM = SUM + DMP*DMP 

CONTINUE 

IF(IPR .ШЕ. 1)WRITE(6,20)T, (TD(J) , J21, JMAX) 
VRITE(6,23)T, (TE(J) , J=1, JMAX) 

FORMAT(/,' T= ?,F5.2,' TEs',11F6.2,//) 


RMS IS TNE RMS ERROR 


AVS * SUM/(1. * AJM) 

RMS = DSQRT(AVS) 
WRITE(6,24)RMS 

FORMAT(' RMS DIF = ',D11.4,//) 
STOP 

EID 


Fig. 7.4. (cont) Listing of program DIFEX 
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1 
2 SUBROUTINE EXTRA(JMAI,MAXEX,DELI,PI,ALPH,T,TE) 
3c 
4c EXACT SOLUTION OF THE TRANSIENT HEAT CONDUCTION PROBLEM 
5c 
6 DIMENSION X(41),TE(41) 
тс 
8 ро 2 J = 1,JMAž 
9 417-1-1 
10 X(J) = DELI*AJ 
11 TE(J) s 100. 
12 DO 1 M = 1,MAXEX 
13 AM = M 
14 DAM = (2.46 - 1.) 
15 DXN = DAM*PI*I(J) 
16 DTM = -ALPH*DAM*DAM*PISPIST 
17 С 
18 C LIMIT ТИЕ ARGUMENT SIZE OF EXP(DTM) 
19 C 
20 IF(DTM .LT. - 25.)DTM = -25.0 
21 DTM - EXP(DTM) 
22 IF(DTM .LT. 1.0E-10)GOTO 2 
23 1 TE(J) = TE(J) - 400. /DAM/PI*SIN(DXM)*DTM 
24 2 CONTINUE 
25 RETURN 
26 EXD Fig. 7.5. Listing of subroutine EXTRA 


Table 7.2. Parameters used by program DIFEX 





Parameter Description 





ME - 1, FTCS scheme 
-2, DUFORT-FRANKEL scheme 
=3, 3-level, fourth-order scheme 


IPR controls output; IPR =1 for TD solution 
JMAX number of points in the x direction 

МАХЕХ number of terms іп the exact solution 

NMAX maximum number of time-steps 

ALPH thermal diffusivity, « 

$ = gåt/ dx? 

TMAX maximum time 

TST initial time 

GAM parameter y in (7.16) 

TD dimensional temperature array 

ТМ nondimensional temperature array, Т" іп (7.16) 
TOL nondimensional temperature array, T"~! in (7.16) 
TE exact (dimensional) temperature array 

DELX Ax 

DELT At 

EL coeficients in the difference operator Li. 

T time 

X locetion 0S x $1.0 


RMS rms error in the solution 
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3-LEVEL, 4TH-ORDER SCHEME 


«МАХ» 11 КАХЕХ= 10 NMAX* 500 ТИАХ= 9.00 TST= 2.00 GAM= 0.00 
S= .300 ALPH =  .100E-01] DELT = .300Е%00 DELX = .100Е+00 


T= 9.20 TD=100.00 84.12 69.80 58.45 51.16 48.65 51.16 58.45 69.80 84.12100.00 


Тя 9.20 ТЕ=100.00 84.12 69.80 58.45 51,17 48.66 51,17 58.45 69.80 84.12100.00 
RMS DIF = .4169D-02 Fig. 7.6. Typical output produced by program DIFEX 


The accuracy of various solutions, using program DIFEX, is compared in 
Table 7.3. Two values of s=0.3 and 0.41 have been used to allow the three-level 
fourth-order scheme (7.13) to be compared with the FTCS and DuFort-Frankel 
schemes. In addition the two special cases, FTCS with s=1/6 and DuFort-Frankel 
with 5--(1/12) 42 are included. Both of these cases correspond to truncation error 
cancellation. 


Table 7.3. Computational solutions of (7.1) using explicit schemes 


approx. 
RMS RMS RMS conv. 
Case 5 7 (Ах = 0.2) (dx =0.1) (4х =0.05) rate, r 
FTCS 1/6 0.7266 x 10-2 0.4921 x 10-3 0.3279 x 10-* 3,9 
ЕТС$ 0.30 0.6439 0.1634 0.0413 2.9 
FTCS 0.41 1.2440 0.3023 0.0755 2.0 
D.-F. 0.289 0.0498 0.2328 x 107? 0.1152 x 10-3 4.3 
р.-Е. 0.30 0.0244 0.0136 0.00395 1.8 
D.-F. 0.41 0.8481 0,2085 0,05250 20 
3L-4TH 030 0.0 0.0711 0.00416 0.00022 4.2 
3L-4TH 030 0.5 0.1372 0.00665 0.00029 4.5 
3L-4TH 0.30 1.0 0,2332 0.00916 0.00054 41 
3L-4TH 9041 10 0.7347 0.0229 0.00140 4.0 


At s=0.3 the FTCS scheme demonstrates а second-order convergence rate, Тһе 
approximate convergence rate has been obtained from the ratio of the rms error on 
the 4x —0.1 and 0.05 grids, i.e. 


r =log{RMS,, - о.1/ М5 = о.05)/108(2) 


The DuFort-Frankel scheme is considerably more accurate than the FTCS 
scheme, This is related to the proximity of the special value, s =1/(12)1/2 =0.289, for 
which a fourth-order rate of convergence is achieved. The three-level fourth-order 
scheme (with y=0) is less accurate than the DuFort-Frankel scheme on a coarse 
grid (4x 20.20). The tendency for higher-order schemes to require a finer grid to 
demonstrate superior accuracy is common. As y is increased the accuracy of the 
three-level fourth-order scheme on a given grid reduces although a fourth-order 
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rate of convergence is maintained, approximately. For s=0.41 and y=1.0 the 
three-level fourth-order scheme demonstrates comparable accuracy to the 
DuFort-Frankel scheme on a coarse grid but is substantially more accurate on a 
fine grid. 

The results shown in Table 7.3 indicate that it is straightforward to construct 
higher-order schemes by truncation error cancellation. Such schemes, typically, 
produce higher accuracy on a fine grid, but often with more severe stability 
restrictions. The degree of improvement in accuracy may not be as great for the 
nonlinear equations governing fluid motion as for the diffusion equation, which has 
a very smooth solution in the present situation. 


72 Implicit Methods 


For implicit schemes the spatial term 22Т/дх2 in (7.1) is evaluated, at least 
partially, at the unknown time level (n+ 1). In practice this leads to a coupling of 
the equations for each node (j,n+1) at the (n+ 1)-th time level, and the need to 
solve a system of algebraic equations to advance the solution. 


7.2.1 Fully Implicit Scheme 
The simplest implicit finite difference representation for (7.1) is 


(T3+1—- T3) щ(Т{1}—2Т]'1+Т]{1) 


-0. 4 
At Ax? (7.19) 


To generate a useful algorithm (7.19) is rewritten as 


—5Т1}+(1+25)Т111—5Т11}=Т1. (7.20) 


А Taylor expansion about the (j,n)-th node indicates that this scheme has а 
truncation error 


2Т 
g-- 5 (ie ЕЗІ + O(At?, Ax*) . 


This is the same order as for the explicit (ЕТС) scheme, (7.5) with 5 # 1/6, although 
the multiplying constant is larger. 

Application of the von Neumann stability analysis (Sect. 4.3) produces the 
following expression for the amplification factor: 


G-[1-2s(1—cos80)] ! . 


For any choice of 0, |G| € 1 if s 0. That is, (7.20) is unconditionally stable. This is 
clearly an improvement over the conditionally stable explicit scheme (7.5). 
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However, to solve (7.20) it is necessary to consider all the nodes j and the 
corresponding equations. Thus a matrix of equations can be written for the 
unknown values 7751; 


(1-2) -в T? 4, 
-8 (1+2) -в Тї! d, 
... ... ... ||... 121 
—s (1+2 -s Tt! d; (7.21) 


-s (1429! Ire] |а, 
In (7.21) 
d,=T3+sTy"! , 
4,= Тї, d;-,=T"%_, +5757) 3 


where 7771 and T5*! are known from the Dirichlet boundary conditions. It is 
apparent that the system of equations is tridiagonal. Consequently the Thomas 
algorithm (Sect. 6.2.2) can be used to solve (7.21) in 5(J —2) — 4 operations (only 
multiplications and divisions are counted). 

In practice, allowing for the setting up of the equations, the solution of the 
implicit system of equations (7.21) via the Thomas algorithm, requires twice as 
much computer time, typically, as solving the same number of explicit equations 
(7.6). The time-step can be made considerably larger than the limiting explicit time- 
step, At,,,=0.54x7/a; however, then the accuracy of the solution will be less. 


7.2.2 Crank-Nicolson Scheme 


An alternative implicit algorithm for (7.1) is provided by the Crank-Nicolson 
scheme (Fig. 7.2) which 15 


(тр 


qp o5 «5L, T) -065L, T)*1)-0 , (1.22) 


where 


T; .1—2T;4 T, 
L т= 1 jtijels 
«T Ax? 
Effectively this scheme evaluates the spatial derivative at the average of the nth and 
(n+ 1)-th time levels, i.e. at the (n+ 1/2)-th time level. If a Taylor expansion is made 
about (j, n + 1/2), (7.22) is found to be consistent with (7.1) with a truncation error of 
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O(AC, Ax?). This is a considerable improvement over the fully implicit and FTCS 
schemes that are only first-order accurate in time. 

А von Neumann stability analysis indicates that the Crank-Nicolson scheme is 
unconditionally stable, Table 7.1. A rearrangement of (7.22) gives the algorithm 


— 0.55 Т"+1+(1+5) Т"*1 -O5s T7121 
=0.5s Т"_,+(1-9Т"+0.55 T7, , (7.23) 


which may be compared with (7.20) By considering all spatial nodes (7.23) 
produces a tridiagonal system of equations which can be solved efficiently using the 
Thomas algorithm. 

Because of the second-order temporal accuracy, the Crank-Nicolson scheme is 
a very popular method for solving parabolic partial differential equations. The 
properties of the Crank-Nicolson scheme are summarised in Table 7.1. 

A generalisation of (7.22) can be obtained by writing 


атты 


At —a[(1—8) Lyx T7+BL,.T5**J=0, (7.24) 


where 47351-787%1--7% and 05851. If В=0 the FTCS scheme is obtained. 
If B=0.5 the Crank-Nicolson scheme is obtained and if B=1.0 the fully implicit 
Scheme is obtained. 

А von Neumann stability analysis of (7.24) indicates that a stable solution is 
possible for 


0.5 Ax? 
<-- if 0x 1 
45 7 3B if 0€ B<1/2 
no restriction if 1258351. 


It may be noted that the Crank-Nicolson scheme is on the boundary of the 
unconditionally stable regime. For many steady flow problems it is efficient to 
solve an equivalent transient problem until the solution no longer changes 
(Sect. 6.4. However, often the solution in different parts of the computational 
domain approaches the steady-state solution at significantly different rates; the 
equations are then said to be stiff (Sect. 7.4). Unfortunately the Crank-Nicolson 
scheme often produces an oscillatory solution in this situation which, although 
stable, does not approach the steady state rapidly. Certain three-level (in time) 
schemes are more effective than the Crank-Nicolson scheme in this regard. 


7.33 Generalised Three-Level Scheme 


For the diffusion equation, a generalised three-level scheme that includes (7.24) can 
be written 
(947)! у47) | 
At 


At a [(1 — $) L,, T5 -BL,,T5* !]-20 ry (7.25) 
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where 47"-7%-7%7!, The inclusion of the extra time-level implies a larger 
memory requirement to store the solution. However, the modern trend is for 
computer memories to become cheaper and larger. A second effect is that ad- 
ditional execution time is required, typically 10%-15%, to manipulate the ad- 
ditional terms. 

A particularly effective three-level scheme is given by the choice: y 20.5, В = 1.0. 
This scheme has а truncation error of O(At?, 4х2), is unconditionally stable, can be 
solved using the Thomas algorithm and damps out the spurious oscillations, 
discussed above in relation to stiff problems. We will refer to this scheme as. the 
three-level fully implicit (3LFI) scheme and will make further use of it when 
discussing approximate factorisation (Sects. 8.2 and 8.3). The 3LFI scheme has the 
useful feature of being A-stable (Sect. 7.4). 

The properties of some of the various numerical schemes for representing the 
diffusion equation (7.1) are shown in Table 7.1. Many more schemes are given by 
Richtmyer and Morton (1967, p. 189). 


7.24 Higher-Order Schemes 


The starting point for this section will be the discretised equation, (7.25), modified 
to embrace both finite difference and finite element three-level schemes. Thus (7.25) 
is replaced by 





—a[fL,T'*'-(1-8)L,T7]-0 , (726 


Ату! TM, AT 
At At 


ағум4 


where 
AT; = ТТ" , AT;-T;-T3 | , and 


 Tj-1-2T;t Tj, 
4 Ах2 . 


The similarity in the structure of (7.26) and (7.13) is noteworthy. For the finite 
element method, M, = (1, 4, 1), so that 


МАТ, = АТ, , +447; ЖАТ. | (727) 


For the finite difference method, М„= {0,1,0}. The parameters у апа В may be 
chosen to provide particular levels of accuracy and/or stability. In Sect. 7.2.2 the 
various schemes correspond to y=0. The particular choice y=0, B=0.5 gives the 
Crank-Nicolson method. Results for both finite element and finite difference forms 
of the Crank-Nicolson method are provided in Table 7.3. The choice y =0.5, B= 1.0 
is discussed, briefly, in Sect. 7.2.3. In this section y will be treated as a free parameter 
but В will be treated as a function of y. 


7.2 Implicit Methods 231 


A Taylor series expansion of (7.26) about node (j,n) produces the following 
expression for the truncation error leading term: 
(| 6-і 


Ej=asdx? =z ( 05+у+—11- в) + O(Ax*) , (7.28) 


where the mass operator is written as 
M, = {6,1 —26,5} . (7.29) 


This includes both the finite difference (6=0) and finite element (6=#) formu- 
lations. The form of the truncation error (7.28) has eliminated all time derivatives, 
as in Table 7.1. 

The schemes considered previously (Sect. 7.2.3, etc.) have corresponded to the 
choice f =0.5 + у. However, it is clear that fourth-order accuracy should be possible 
for the choice 


— 


i 
н. (7.30) 





В=0.5+у+ 


In turn this motivates the choice M, = (i5, à, 15), since this will produce a fourth- 
order truncation error with B=0.5 +y. 

Equation (7.26) is applied at every node producing a tridiagonal system of 
equations of the form 


A, Tht + В, TH+ СТ =(1 + 2у) M, 7%-уМ, Th c (1-f)sL.. T] ‚- 
7.31 
where ( 


А,-Сұ-(1-у)4-88, В;=(1+7)(1— 25) + 258 and 
І,,Тң-Т!-1-278-47%1. 


Although the solution of (7.31) requires a higher operation count than the fully 
implicit or Crank-Nicolson schemes it is considerably more accurate (Sect. 7.2.5). 


72.5 DIFIM: Numerical Results for Implicit Schemes 


In this section the accuracy of the higher-order scheme (7.31) is compared with the 
accuracy of the lower-order implicit schemes and the low and high-order explicit 
schemes of Sect. 7.1. 

Program DIFIM is an extension of program DIFF to obtain the compu- 
tational solution of (7.1) subject to the boundary conditions (7.18) by repeatedly 
solving (7.31) to advance the solution in time for all interior nodes. The solution of 
(7.31) is undertaken by subroutines BANFAC and BANSOL. Since A,, В; and С; 
are independent of time it is only necessary to call BANFAC once, at the first time- 
step. A listing of program DIFIM is provided in Fig. 7.7. Program DIFIM can 
invoke five options as shown in Table 7.4, which correspond to different choices of 
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1 

2c 

3c DIFIM.FOR SOLVES THE DIFFUSION EQUATION USING 
4c VARIOUS IMPLICIT SCHEMES 

sc 

6 DOUBLE PRECISION SUM,AVS,RMS,DSQRT,DMP 

7 DIMENSION DUM(65),TE(41),A(5,65),D(65) 

8 DIMENSION ELI(3),EMX(3),TOL(41),TE(41) , TO(41) 
9c 

10 OPEN(1,FILE=’DIFIM. DAT’) 

11 OPEN (6 ,FILE=’*DIFIM. OUT’) 

12 READ(1, 1) ME, IPR,JMAX,MAXEX,NMAX,ALPE,S,TMAX,TST, САИ 
13 1 FORMAT(SIS,E10.3,F5. 3,3F5.2) 

14 C 

15 PI - 3.1415927 

16 JMAP = ЈМАХ - 1 

17 АМ = ОМАР 

18 DELX = 1./AJM 

18 DELT = DELI*DELK*S/ ALP 
20 C 
21 IF(ME .EQ. 1)WRITE(6,4) 
22 IF(ME .EQ. 2)WRITE(6,5) 
23 IF(ME .EQ. 3)WRITE(6,6) 
24 IF(ME .EQ. 4)WRITE(6,7) 
25 ТЕ(МЕ .EQ. 5)WRITE(6,8) 
26 WRITE(6 , 2) JMAX ,MAXEX ,MMAX , TMAX , TST, САМ 
27 2 FORMAT(' ЗМАХ=?,15,’ MAXEX=',I5,' ШМАХ=',15, 


1’ ТВАХ=’,Р8.2,’ TST-',F5.2,' GAM-',F5.2) 
WRITE(6 , 3)S, ALPH, DELT,DELX 


1: DELI = ',E10.3.//) 


4 FORMAT( 


+ 


DI 


FFUSION EQUATION:  FDM-2JD',//) 


28 
29 
30 3 FORMAT(' S= ',F5.3,' ALPH = ',Е10.3,’ DELT = 'E10.3, 
31 
32 
33 


5 FORMAT( 


+ 


DI 


FFUSION EQUATION:  FEM-2J4D',//) 


34 6 FORMAT(' DIFFUSION EQUATION:  FDM-4TR',//) 
35 7 FORMAT(’ DIFFUSION EQUATION:  FEM-4TM',//) 
36 8 FORMAT(' DIFFUSION EQUATION:  COMP',//) 

31 с 

38 C IMPLICIT PARAMETERS 

39 C 

40 BET = 0.5 + САМ 

41 IF(ME .EQ. 3)BET = BET - 1./12./5 

42 IF(ME .EQ. 4)BET - BET * 1./12./S 

43 IF(ME .EQ. 1 .OR. ME „EQ. 3)EMI(1) = O, 
44 IF(ME .EQ. 2 .OR. ME .EQ. 4)ЕМХ(1) = 1./6, 
45 IF(ME .EQ. S)EMX(1) = 1./12. 

46 ЕМХ(2) = 1. - 2.ФЕМХ(1) 

47 ЕМХ(3) = EMI(1) 

48 AD = EMX(1)*(1. + САМ) - ВЕТ*$ 

49 BD = ENX(2)*(1. + САМ) + 2.*BET#S 

50 CD = aD " 

51 ELY(1) = 1. 

52 ELI(2) = -2. 

53 ELI(3) = 1. 

54 JMAF = JMAX - 2 

55 C 


56 С ОВТАТЕ 


INITIAL CONDITIONS FROM THE EXACT SOLUTION 


Fig. 7.7. Listing of program DIFIM 


B as a function of y and different choices for M,. Case 1 with y=0 corresponds to 
the Crank-Nicolson scheme (7.26). Many of the major parameters used by 
program DIFIM are the same as used in program DIFEX (Table 7.1). Ad- 
ditional parameters are shown in Table 7.5. A typical solution corresponding to 


ME=3 and y=0 is shown іп Fig. 7.8. 
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109 
110 
111 
112 
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114 
115 
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10 


11 


12 


13 
14 


15 


16 


17 


7.2 


T = TST - 2.*DELT 
DO 10 I - 1,2 
T= T + DELT 


CALL EXTRA (JMAX,MAXEX,DELI,PI,ALPH,T,TE) 


00 9 J = 2,JMAP 

IF(I .EQ. 1)TOL(J) = TE(J)/100. 
IF(I .EQ. 2)TN(J) = ТЕ(Ј)/100. 
CONTINUE 

CONTINUE 


SET BOUNDARY CONDITIONS 


TOL(1) = 1. 

ТОГ(ЈМАХ) = 1. 

TN(1) = 1. 

TN(JMAI) = 1. 

TO(1) = 100.*TH(1) 
TO(JMAI) = 100.*TN(JMAI) 
reo 


EACH TIME STEP STARTS AT STATEMERT 11 


SET UP TME TRIDIAGONAL SYSTEM OF EQUATIONS 


DO 14 J = 2,JMAP 
JM42J-1 

ІҒ(Е .СТ. 1)60TO 12 
А(1,Л0 
AC2, JM) 
АС3,1М) 
ACA, JM) 
AC5, JM) 
D(JM) = 
DO 13 K = 1,3 

KJ2J-2*K 

D(JM) = DCJM) + ЕМХ(К)*((1. + 2. *GAM) *TE(KJ) - GAM*TOL(KJ)) 
D(JM) = D(JM) + S*ELX(K)*(1.-BET) *TH(KJ) 

CONTINUE 

CONTINUE 

D(1) = D(1) - A(2,1)*TI(1) 

D(JMAF) = D(JMAF) - А(-:, МАР) #ТИСЈМАХ) 


298552 


"n оп AN 


SOLVE BANDED SYSTEM OF EQUATIONS 
IF(N .EQ. 1)CALL BANFAC(A,JMAF 1) 
CALL BANSOL(D,DUM,AÀ, ЈМАҒ, 1) 


00 15 J = 2, ЛАР 
TOL(J) = THCJ) 


ТЕС) = DUM(J-1) 
DO 16 J = 2, ЛАР 
ТОС) = 100,*THCJ) 
T = T + DELT 


IF(IPR .EQ. 1)WRITE(6, 17) T, (TO(2) ,J=1, JMAX) 
FORMAT(! T= ',F5.2,? TO=’ ,11F6.2) 


Fig. 7.7. (cont.) Listing of program DIFIM 


Implicit Methods 


233 
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119 С 
120 С IF MAXIMUM TIME OR MAXIMUM WUMBER OF TIME-STEPS EXCEEDED 
121 С EXIT FROM LOOP 

122 С 

123 IF(N .GE. NMAX)GOTO 18 

124 IF(T .LT. TMAX)GOTO 11 

125 C 

126 C OBTAIN EXACT SOLUTION AND COMPARE 

127 C 

128 18 CALL EXTRA(JMAI,MAIEI,DELX,PI,ALPB,T,TE) 
129 C 

130 SUM - 0. 

131 DO 19 J = 1,JMAX 

132 DMP = TE(J) - TOÇI) 

133 SUM = SUM + DMP*DMP 

134 19 CONTINUE 

135 IF(IPR .ШЕ. 1) WRITE(6,17)T,(TO(J),J=1, JMAX) 
136 VRITE(6,20)T. (TE(J) , J=4, JMAX) 

137 20 FORMAT(/,' T= ',F5.2,' TE-',11F6.2,//) 

138 C 

139 С RMS IS THE RMS ERROR 

140 C 

141 AVS = SUM/(1. + AJM) 

142 RMS = DSQRT(AVS) 

143 WRITE(6,21)RMS 

144 21 FORMAT(? RMS DIF = ',D11.4,//) 

145 STOP 

146 END 


Fig. 7.7. (cont) Listing of program DIFIM 


Table 7.4. Different options implemented in program DIFIM 


Case (ME) Descriptor M, В 

1 FDM-2ND огдег (0, 1, 0) 0.5 +7 

2 FEM-2ND order (1/6, 2/3, 1/6) 0.5+7 

3 FDM-4TH order (0,1,0) 0.5 +y~1/(12s) 
4 FEM-4TH order (1/6, 2/3, 1/6) 0.5 ++y+1/(12s) 
5 COMPosite (1/12, 5/6, 1/12) 0.5 +7 


Table 7.5. Additional parameters used т program DIFIM 





Parameter Description 

ME parameter controliing choice of method (Table 7.4) 
GAM parameter y used in (7.26) 

BET parameter f used in (7.26) and Table 7.4 

EMX mass operator, М, 

ELX difference operator, Li, 


AD, BD, CD А, В, and C, used in (7.31) 
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DIFFUSION EQUATION:  FEM-4TH 


ЧИАХ= 11 МАХЕХ= 10 НИАХ= 500 TMAX= 12.00 TST= 4.50 GAM= .00 
S= 1.000 ALPH = .100Е-01 DELT =  .100E*01 DELX = > .100Е%00 


T= 12.50 TDx100.00 88.55 78.22 70.02 64.76 62.94 64.76 70.02 78.22 88.55100.00 


Тж 12.50 ТЕз100.00 88,54 78.21 70.00 64.74 62.92 64.74 70.00 78.21 88.54100.00 


RMS DIF = .15220-01 Fig. 7.8. Typical output produced by program DIFIM 


Table 7.6. Accuracy of the implicit schemes (Tabie 7.4) when solving (7.1) 


Case RMS RMS RMS . approx conv. 
(ME) y (4х=0.2) (4х=0.1) (4x = 0.05) rate, ғ 

1 0 0.3895 0.1466 0.03993 1.9 

2 0. 0.8938 0.1787 0.04185 2.1 

3 0. 0.2393 0.01526 0.001053 3.9 

4 0 0.2393 0.01522 0.000897 4.1 

5 0 0.2393 0.01525 0.001034 3.9 

1 10 2.090 0.03003 0.03245 3.0“ 

2 10 1.760 0.2475 0.04668 2.4 

3 10 2.367 0.1246 0.008129 3.9 

4 1.0 1.395 0.09269 0.005912 40 

5 1.0 1.867 0.1087 0.007097 3.9 7 


* Based on RMS,, ..92/RMS4, 2005 


Program DIFIM has been used to compare the various methods shown in 
Table 7.4. Comparative results are presented in Table 7.6 for the solution accuracy 
on various grids (4x =0.2, 0.1 and 0.05) at t= 12.0. These results were obtained with 
& —0.01 and the initial condition defined by the exact solution (3.42) at t=4.5. 
Dirichlet boundary conditions have been imposed on Т at х=0 and 1.0. The 
approximate convergence raté has been obtained as in Table 7.3. 

All the results presented in Table 7.6 have been obtained at s=1.0. The 
nominally second-order methods (cases 1 and 2) are showing approximately 
second-order convergence and the nominally fourth-order methods (cases 3, 4 and 
5) are showing fourth-order convergence. This is occurring at both y 20 and y=1. 

Generally, increasing y reduces the level of accuracy while maintaining ap- 
proximately the same convergence rate. For coarse grids the accuracy demon- 
strated by the higher-order schemes (cases 3-5) is not significantly higher than that 
demonstrated by the lower-order schemes (cases 1 and 2), particularly at large y. 
However on a fine grid the difference in accuracy is substantial. Cases 3, 4 and 5 
show comparable accuracy on coarse and fine grids at y — 0. However at y = 1, case 
4 is more accurate than cases 3 or 5, particularly on a coarse grid. 


236 7. One-Dimensional Diffusion Equation 


Table 7.7. Comparison of implicit and explicit schemes, ;=0.41 


RMS RMS RMS Approx conv. 
Case y (4x 20.2) (4x — 0.1) (4х= 0.05) rate, г 
FDM-2ND, imp 0. 0.7502 0.2004 0.05127 2.0 
FDM-4TH, imp 0. 0.03718 0.002407 0.000206 3.5 
FTCS, exp 0. 1.2440 0.30230 0.07550 2.0 
FDM-2ND, imp 1. 0.7625 0.1681 0.04938 1.8 
FDM-4TH, imp 1. 0.6482 0.03155 0.00192 4.0 
FDM-4TH, exp 1, 0.7347 0.02290 0.00140 4.0 





Some of the implicit and explicit schemes are compared in Table 7.7, for both 
y=0 and y= 1.0 with s=0.41. These results have been obtained at £29.00 with the 
initial specification of Т at t= 2.00. On a coarse grid the various schemes demon- 
Strate comparable accuracy. However, on a fine grid the implicit schemes are 
considerably more accurate. 

The generally high levels of accuracy achieved with fine grids are partly a 
reflection of the smoothness of the exact solution and the relative simplicity of the 
governing equation. As noted in Sect. 7.1, such high levels of accuracy should not 
be expected when dealing with fluid dynamic problems. However, the selection of 
the coefficients in the discretised equation, as in (7.26), to reduce the truncation 
error is a valid technique, as long as the stability of the algorithm is sufficient. 
Generally the improved stability behaviour of implicit schemes, compared with 
explicit schemes, permits more flexibility in the choice of the free parameters, i.e. y 
and f in (7.26). 


7.3 Boundary and Initial Conditions 


In Sects. 7.1 and 7.2 Dirichlet boundary conditions are used and where two levels of 
data are necessary as initial conditions, these are provided by the exact solution. 
Consequently the only source of error in the solutions arises from the discretisation 
of the derivatives in the governing equations (7.1). In this section boundary and 
initial conditions will be considered in situations where additional errors are 
introduced in implementing the boundary and initial conditions. 


7.3.1 Neumann Boundary Conditions 


The algorithms developed so far, for example (7.6), are appropriate to internal 
nodes. To use these formulae for boundary values, іс. Т^“! in Fig. 79, would 
require knowledge of the solution outside the computational domain. Therefore 
special formulae must be developed at the boundaries. For Dirichlet boundary 
conditions (7.2) there is no difficulty. Where a value for 77 is required the 
substitution 77 = Б" is made, from (7.2). 
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Tax = c(tn) (ӘТ/ әкі = celtne) 
(T2 - T5)/2Ax > (Tol - 17 *1)/2Ах 
= с^ = nel 
nel nel om — 
]=0 1 2 
n o= n 
1:0 1 2 
x20 x20 


Fig. 7.9. Treatment of Neumann 
Explicit treatment Implicit treatment boundary conditions 


Neumann boundary conditions (7.3) pose a greater problem. A one-sided finite 
difference expression, using only information inside the domain, can be introduced 
for [0T/0x], in (7.3). The result is 


T^*! — Ti! 
| 4x 7 


In a typical application the FTCS scheme (7.6) would be used to obtain the value of 
T**! at all interior nodes, j=2,..., J—1. At the boundary, x —0, (7.32) gives 


ci. (7.32) 


ТИ: = Та +! Ax. (1.33) 


The major problem here is that (7.33) has а truncation error of O(4x) whereas the 
FTCS scheme has a truncation error of O(Ax?). Since the diffusion equation is а 
parabolic equation (Sect. 2.3) the lower accuracy of the solution at the boundary 
will affect the accuracy of the solution in the interior for all later time. 

Therefore it is desirable to represent the Neumann condition (7.3) with an 
algebraic expression that has the same truncation error as the expression used for 
the interior nodes. This can be achieved in the following way. Equation (7.3) is 
represented by 


T2-To_ 
2Ax — 

To achieve a truncation error of O(Ax?) (7.34) includes the fictitious node (0, п) 
which lies outside the computational domain (Fig. 7.9). However, if the com- 
putational domain is extended, notionally, to include this point, then (7.34) can be 


combined with an interior equation, for example (7.5) centred at (1, п), to eliminate 
T^. The result is 


T%*1 = —2sAxc"4(1—2s)T"+25T% , (7.35) 





с". (7.34) 


and the truncation error is of O(At, Ах?) everywhere іп the domain. If an implicit 
scheme is being used in the interior, for example (7.19), (7.34) is evaluated at time- 
level n+ 1 and combined with (7.20) to give (Fig. 7.9) 


(1+2s)T%*!—2s Т"+1 - T1 —2sAxc"*! , (7.36) 
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which becomes the first equation in the tridiagonal system equivalent to (7.21). This 
system can be solved by the Thomas algorithm. The construction used to create 
(7.35 and 36) is also applicable if Neumann boundary conditions are required at 
x=1 (Fig. 7.1). 

It is expected that the use of different formulae at the boundaries may alter the 
stability properties. Strictly the relatively straightforward von Neumann stability 
analysis (Sect. 4.3) is only applicable to internal equations; although Trapp and 
Ramshaw (1976) suggest that the von Neumann analysis can be applied, heu- 
ristically, at boundaries. Alternatively the matrix method may be used to determine 
the stability of the complete system of equations, including those formed at the 
boundaries. 

For the diffusion equation Mitchell and Griffiths (1980, pp. 47-53) discuss the 
application of the matrix method to a Crank-Nicolson formulation with various 
boundary conditions. They show that with Dirichlet boundary conditions the 
Crank-Nicolson scheme is unconditionally stable. With Neumann boundary con- 
ditions the Crank-Nicolson scheme is stable but one eigenvalue of the ampli- 
fication matrix is unity which produces an oscillatory solution. For the general 
mixed boundary condition an additional restriction is necessary on the parameters 
in the boundary condition to ensure stability. 


7.3.2 Accuracy of Neumann Boundary Condition Implementation 


The implementation of Neumann boundary conditions (Sect. 7.3.1) with the dif- 
fusion equation will be combined with some of the schemes described in Sects. 7.1 
and 7.2 and the impact on the overall accuracy will be indicated. Here solutions to 
the diffusion equation 


T 27 
aT UT 6 


a.” 9х2 (7.37) 


will be obtained іп the spatial interval 0.1 £x < 1.0, with boundary conditions 
ax. 


2 
ôT с= 2 – 2л sin (0.05л)ехр |- « (5) 1 at х=0.1 (7.38) 


and T=2 at x=1.0. 
Initial conditions are chosen to be 


Т--2х--4сов(0.5лх) at t=0. (7.39) 
This problem has the exact solution 


2 
T — 2x + 4cos(0.5nx)exp |- a (5) 1 ; (7.40) 


which will be used to assess the accuracy of the computational solution. 
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Numerical solutions of the above problem have been obtained using a modified 
form of program DIFEX (Fig. 7.4). The rms errors for different size grids are 
shown in Table 7.8. The rms errors were computed at t=9.00. To avoid errors in 
implementing the initial conditions the exact solution (7.40) has supplied the initial 
conditions at t=0.80. All solutions shown in Table 7.8 have been obtained with 
$0.30. 


Table 7.8. Explicit interior schemes with Neumann boundary condition at x=0.1 


Boundary rms error Approximate 
Interior condition ——————————————— convergence 
method formula 4x —0.225 Ax —0.1125 4x —0.05625 rate, r 
FTCS (7.32) 0.1958 0.07978 0.03539 1.2 
FTCS (7.34) 01753x107?  0.4235x10^? 01064х1073 20 
3L-4TH, у-0 (7.34) 0.4244х10:2 0.9142х10-3 0.2144х10-3 21 
3L-4TH, y=1 (7.34) 0.8684 х 10-2 02034х10:2 0.4867х10-3 24 


The combination of boundary condition implementation (7.32) and the FTCS 
scheme is seen to produce a relatively inaccurate solution which increases in 
accuracy roughly linearly with grid refinement. Thus the first-order spatial accu- 
racy associated with (7.32) is over-riding the second-order spatial accuracy associ- 
ated with the FTCS scheme. This is confirmed by the error distribution for the 
Ax =0.225 grid shown in Table 7.9. The solution is least accurate at x =0.1 where 
the Neumann boundary condition is applied and becomes progressively more 
accurate as x = 1.00 is approached. 


Table 7.9. Error distribution for 4x 20.225 in Table 7.8 


х= 01 0.325 0.550 0,775 1.000 
FTCS +(7.32) —0.3799 —0.1989 — 0.08366 —0.02610 0.000 
FTCS (7.34) 0.332 х 10-3 -0221х10-2 —0.272х10-2 —0.173х10-2 0.000 
3L-4TH, 0832x107? — 0421x107? 0166х10-2 0.572х10-3 0.000 
y=0+(7.34) 


The second-order first derivative specification (7.34) can be combined with the 
FTCS scheme to produce the algorithm (7.35) at x 20.1. The rms errors for this 
combination are shown in Table 7.8. Clearly more accurate solutions are being 
obtained than with the first-order Neumann boundary condition specification 
(7.32. Equation (7.34) has a second-order truncation error as has the FTCS 
scheme. The combination is demonstrating, Table 7.8, second-order convergence. 
An examination of the error distribution, Table 7.9, for this combination indicates 
that the error at x 20.1, where the Neumann boundary condition is applied, is 
smaller than the error elsewhere. 
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The second-order boundary condition specification (7.34) can be combined 
with the fourth-order internal scheme (7.16). This is done most easily by using (7.34) 
to obtain Т" explicitly, and then applying (7.16) centred at node 1. The rms errors 
for various degrees of grid refinement are shown in Table 7.8 for two values of y. 
Generally, increasing y reduces the accuracy. The second-order truncation error of 
the boundary condition specification is sufficient to reduce the global convergence 
rate to second order. In fact the accuracy for both values of y is less than the 
accuracy of the FTCS scheme. A consideration of the error distribution, Table 7.9, 
indicates that the error produced by the 3L-4TH scheme is significantly larger close 
to x 20.1, where the derivative boundary condition is applied, but is smaller close 
to х= 1.00, than the error produced by the FTCS scheme. 

In principle a higher-order representation for 0T/Ox than (7.34) could be 
introduced that involved additional nodal values, Т”, etc. However, besides рго- 
ducing a more complicated algorithm it would be necessary to establish the 
stability restriction on s. Strictly this should be undertaken with the matrix method 
(Sect. 4.3). Often introducing higher-order boundary condition formulae reduces 
the working range of s associated with the interior algorithm. 

The first- and second-order boundary condition specifications (7.32, 34) have 
also been combined with some of the implicit schemes described in Sect. 7.2, to 
produce the solution errors shown in Table 7.10. The rms errors are computed at 
t=15.00. The exact solution (7.40) has been used to provide the initial solution at 
t — 5.20. The solutions shown in Table 7.10 have been obtained at s— 1.0. The two 
interior methods, FDM-2ND and FDM-4TH, correspond to МЕ-і and 3, 
respectively, in Table 7.4. 


Table 7.10. Implicit interior schemes with Neumann boundary condition at x=0.1 





Boundary rms error Approx. 
Interior condition | ——————————————————————— convergence 
method formulae  4x-0.2225  4x-0.1125 4х= 0.05625 rate, r 
FDM-2ND (МЕ =1) (7.32) 0.1448 0.07138 0.03344 11 
FDM-2ND (ME = 1} (7.34) 0.01478 0.00363 0.00087 24 
FDM-4TH, у-0 (7.34) 0.00912 0.00223 0.00053 2.1 
FDM-4TH, у-1.0 (7.34) 0.01478 0.00539 0.00141: 19 





The combination of the first-order boundary condition specification (7.32) and 
the second-order FDM scheme produces an overall first-order convergence rate 
and a solution accuracy that is comparable to that produced by the corresponding 
explicit scheme, Table 7.8. This comparable accuracy is in spite of the larger s used 
to produce the implicit results. This provides indirect confirmation that the first- 
order boundary condition specification is dominating the rms error evaluation. 

A significant reduction in error is achieved by utilising the second-order 
boundary condition specification (7.34) particularly on the finest grid. The accu- 
гасу is not as great as produced in combination with the FTCS scheme, although 
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the convergence rate is second order. Switching to a fourth-order interior scheme, 
FDM-4TH, produces a slightly more accurate solution but with only a second- 
order convergence rate. For y=1.0 the accuracy is less than for y 20, which is 
consistent with the behaviour of the explicit scheme. 

To achieve a higher-order convergence rate it would be necessary to introduce a 
more accurate finite difference representation for the Neumann boundary con- 
dition at x 20.10. Since this would couple, typically, four or five grid points it 
would necessitate a modification to the Thomas algorithm for solving the matrix of 
discretised equations. 


7.33 Initial Conditions 


Initial conditions, of the form (7.4) do not cause any difficulty for a two-level 
scheme, like (7.6 or 20), except at the boundary if a Dirichlet boundary condition is 
enforced. It can happen that, at a boundary х=0 say, the value of Т(0,0) specified 
by the initial condition is different from the value of T(0,0) specified by the 
boundary condition. The preferred strategy is to average the values of T(0, 0) for 
the first time-level, n=0, but to revert to the proper boundary condition for 
subsequent time. In Table 3.7 this strategy is compared with the alternative of 
specifying Т(0, 0) by the initial conditions for the first time level, п=0. For this 
particular example the averaging strategy is seen to produce a more accurate 
solution. 

Three-level schemes, like (7.25), require two levels of known data; therefore they 
must be supplemented at the start of the integration. This should be done with a 
two-level scheme that achieves the same or better accuracy. Thus a second-order 
two-level scheme like the Crank-Nicolson method is appropriate. Alternatively a 
first-order temporal scheme like the FTCS scheme can be used as long as the 
second level of initial data is computed on a fine grid to compensate for the first- 
order accuracy. Richardson extrapolation (Sect. 4.4.1) is useful for this purpose. 


7.4 Method of Lines 


Algorithms developed in Sects. 7.1 and 7.2 introduce discretisation formulae for 
both the time derivative and the spatial derivative terms in (7.1) simultaneously. 
This often permits cancellation of terms in the respective truncation error ex- 
pressions so that higher-order accuracy is achieved. 

However, an alternative philosophy is to discretise the spatial term first 
reducing the partial differential equation to a system of ordinary differential 
equations for the nodal values. Thus (7.1) becomes, notíng the change in dependent 
variable from T to u, 


du, a(uj-i —2u;4 Н+ 1) = 


di Td 0, (7.41) 
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where a second-order three-point finite difference expression has been used to 
discretise д2й/дх? іп (7.1). The process of reducing (7.1) to (7.41) is an example of the 
method of lines (Holt 1984) or semi-discretisation. 

An attractive feature of the method of lines is that the various techniques for 
solving systems of ordinary differential equations (Gear 1971; Lambert 1973; 
Seward et al. 1984) are available to solve the semi-discrete form of the original 
partial differential equation. However, it should be emphasised that the con- 
struction of the semi-discrete form introduces an error associated with the spatial 
discretisation. Consequently the “best” choice for solving the resulting system will 
usually be an algorithm of lower order than if the system of ordinary differential 
equations contains no approximation. The more effective methods for solving 
initial value problems governed by ordinary differential equations can usually be 
categorised as either linear multistep methods or Runge-Kutta methods. 

The repeated application of (7.41) at all internal nodes produces а system that 
can be written | 


и ди. (7.42) 


If the spatial term in (7.1) had been nonlinear (7.42) would be written more 
generally as 


AF. (7.43) 


du 

à and (7.44) 
du 

= (7.45) 
А general linear multistep method applied to (7.45) can be written 

У au" t= ДЕ У pif" А (7.46) 
і= 0 і=0 


where the solution и"*” is being sought. If В„=0, this solution is obtained 
explicitly; if „550, (7.46) is an implicit algorithm for и"*". 
The simplest case, m=1, gives 


au" *! angu" — At(Bo f" B,fn*) , (7.47) 
which includes the Euler scheme (x, = — ж=1, Во=1, В, =0), 


и" 1-и dtf". (7.48) 
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Applied to the linear system (7.42), in the form of (7.41), this coincides with the 
FTCS scheme (7.5). Equation (7.47) also includes the trapezoidal scheme 


м1 u" = 4100.57" +0.57"* 1). (7.49) 


Applied to (7.41) the trapezoidal scheme coincides with the Crank-Nicolson 
scheme (7.22). 
The two-step interpretation of (7.46) is 


au * 2c a w * o aou" — Atl hof" +B "+В, 7" 2). (7.50) 


When applied to (7.41) with f; =0, (7.50) provides the solution 15 2, explicitly, and 
is identical with (7.12). If 8,40, (7.50) applied to (7.41) includes both (7.25) and 
(7.26). For example, the 3LFI scheme (Sect. 7.2.3) corresponds to the choice 
и =1.5, 4, = —2.0, 4420.5, By = В, =0, В, = 1.0. In fact all of the schemes devel- 
oped in Sects. 7.1 and 7.2 can be interpreted as linear multistep methods. 

For the solution of ordinary differential equations it is customary to use linear 
multistep methods of considerably higher order (involving more steps) and in a 
predictor-corrector mode. An explicit scheme provides the predictor and an 
implicit scheme the corresponding corrector (Gear 1971, p. 136). However, if a 
high-order strategy is applied to the system of equations arising from the semi- 
discretisation of a partial differential equation, e.g. (7.41), a large storage area is 
required to store the u"*! and Е" +! vectors. In addition the use of а high-order 
scheme to march in time would be inefficient since the solution errors would 
probably be dominated by the low-order discretisation of the spatial terms, unless 
the exact solution is very smooth. - 

The need to avoid the storage of a large number of solutions at previous time 
steps motivates the use of one-step methods, such as Runge-Kutta schemes to solve 
semi-discrete systems. The general R-stage Runge-Kutta scheme applied to (7.45) 
can be written 


R 
и" У of", (7.51а) 
гаж 1 
with 
R 
f'=fit+ ata, y+ dt È baf’), r=1,2,...,R, (7.51b) 
ші 
апа 
К 
a=) b, r=1,2,...,R. (7.51c) 
зж] 


Equation (7.51) includes implicit Runge-Kutta schemes which are computationally 
expensive since (7.51b) is a nonlinear equation which must be solved iteratively for 
each of the /” at each time-step. Consequently there is more interest in explicit 
Runge-Kutta schemes for which the upper summation limits in (7.51b, c) are 


244 7, One-Dimensional Diffusion Equation 


replaced by ғ- І. Thus the Euler scheme (7.48) can be interpreted as a single-stage 
(R= 1) explicit Runge-Kutta scheme with c, =1, Б 20, a, =0. 

There is an infinite number of second-order two-stage explicit Runge-Kutta 
schemes. A typical one is the improved Euler scheme 


u*=u"+ Atf” and (7.52) 
шіри +00.5 At(f"+f*). 


Many third-order three-stage explicit Runge-Kutta schemes are available 
(Lambert 1973). The following fourth-order, four-stage explicit Runge-Kutta scheme 
has been widely used for solving initial value problems governed by ordinary 
differential equations: 


зі swi (f'42f*--2/**.f/***) ,— where (7.53) 


u* =u'4+0.5Atf",  f* -f(" и") 

u** =1"+0.5 317+, /** =f(t"* 172, u**) 

u*** = yj" + Atf** , f***zf(t", u***) . 
Clearly higher-order Runge-Kutta schemes require more evaluations of f. When 
solving semi-discrete systems, e.g. (7.43), the repeated evaluation of all the terms 
associated with the spatial discretisation often dominates the overall execution time. 


If the scalar equation (7.44) is considered along with the initial condition, 
u(0) = us, the corresponding exact solution has the form 


u(t) ug e^t Fu, . (7.54) 


For the physical solution to correspond to a stable process for the interval 
0<1< оо, it is necessary that 4x 0. Absolute stability of a linear multistep method 
(7.46) is analysed by applying it to (7.44) and constructing the stability polynomial 


Ms 


(n; -AAtB,)r'=0 . (7.55) 


i20 


Absolute stability is obtained for a given АЛ (Lambert 1973, p. 66) if all the roots, r,, 
of (7.55) satisfy 


Inl«1. (7.56) 


This is essentially equivalent to the restriction required for stability of a discretised 
partial differential equation using the von Neumann stability analysis (Sect. 4.3). 

For the Euler scheme, absolute stability is achieved if —2x A4t <0. Since 4 «0, 
this corresponds to At «2/|A|. For the trapezoidal scheme absolute stability implies 
АДЕ «0; thus the trapezoidal scheme is stable for any choice of At. 

An equivalent criterion of absolute stability to (7.56) is provided by Lambert 
(1973, p. 136) and leads to the limit —2<AAt<0 Юг both the one-stage (Euler 
Scheme) and two-stage explicit Runge-Kutta schemes (7.52). However, the four- 


74 Method of Lines 245 


stage explicit Runge-Kutta scheme (7.53) has the slightly enlarged stability interval 
— 2.78 <AAts0. 

In the method of lines, systems of ordinary differential equations occur. Thus it is 
appropriate to test the absolute stability of equations of the form (7.42). The stability 
polynomial (7.55) can still be derived if 4 is replaced by 4,, the eigenvalues of A. In 
general A, will be complex. However, for a physically stable problem it will be 
necessary that the real part of 4,, КІА,, is non-positive. 

As an example, the application of the Euler scheme (7.48) to (7.41) produces the 
following criterion for absolute stability: 


~2<RIA,Ats0 . (7.57) 


For a system of M equations based on (7.41), A is tridiagonal with eigenvalues 
given by (9.48), ie. 


_a{-2+ 2cosf(kn)/(M + 1)} 








À 7.58 
k Ax (7.58) 
Consequently the severest restriction on At coming from (7.56) is 
4aAt 0.5 Ax? 
—2<- or ді 0245 (7.59) 
Ах a 


Thus the same stability restriction on At is obtained as for the FTCS scheme (7.5). 
This is to be expected since the Euler time-stepping method applied to (7.41) co- 
incides with the FTCS scheme. Application of the trapezoidal rule (7.49) to (7.41) 
leads to the restriction RIA4,4t£ 0. But since RIA, S0 from (7.58) there is no 
restriction on At. The trapezoidal rule applied to (7.41) coincides with the Crank- 
Nicolson scheme (7.22). As these examples suggest, the absolute stability of the 
algorithm solving a system of ordinary differential equations is equivalent to the 
stability of the algorithm solving the related discretised partial differential equation 
(Sect. 4.3). 

If the system obtained from applying the method of lines is nonlinear, e.g. (7.43), 
the determination of the stability of the algorithm depends on the eigenvalues of the 
Jacobian 205/20. However, now the eigenvalues depend on the solution and may 
need to be monitored while the solution is being obtained. 

The exact solution of the system of equations (7.42) formed from (7.41) can be 
written as 


U=} mee -U, , (7.60) 
k=1 


where the eigenvalues 4, are given by (7.58) and e* are the corresponding eigen- 
vectors. The coefficients о, are determined by the initial conditions (х;). From 
(7.58) it is clear that the eigenvalues cover a considerable range, particularly on a 
refined grid. This implies that the contributions to the solution of different eigenvec- 
tors in (7.60) will decay at different rates. 
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If an explicit scheme is used to solve (7.42) the restriction on the time-step will be 
determined by |RIA,|,a.. However, the time over which the integration will be 
required will be determined by |RI4,|,;,. The ratio [Rlylmax/|RlAxlmin is called the 
stiffness ratio. If this ratio is very large, perhaps 10*-10°, the system of ordinary 
differential equations is said to be stiff. Clearly algorithms that have no restriction on 
the time-step are desirable for solving stiff systems. 

Such unconditional stability is guaranteed if the numerical method is A-stable 
(Dahlquist 1963), i.e. its region of absolute stability contains the whole of the left- 
hand half plane Ridt «0. But A-stability is rather demanding. No explicit linear 
multistep or explicit Runge-Kutta scheme is A-stable. A second-order A-stable 
implicit linear multistep method is the highest order possible. 

A related, but less severe, stability concept is A(«) stability (Widlund 1967), which 
is illustrated in Fig. 7.10. A method is A(a) stable if an infinite wedge, symmetric 
about the negative real axis and of half-angle a, can be constructed within which the 
method is absolutely stable. If all the eigenvalues of А are real, as is the case for (7.41), 
a numerical method that is A(0) stable will be effective even if the system is stilf. 





Fig. 7.10. A(o) stability 


To obtain the solution of steady flow problems the numerical algorithm is often 
based on an equivalent pseudo-transient construction (Sect. 6.4). To achieve the 
steady-state solution as efficiently as possible, implicit schemes are constructed to 
exploit the known stability behaviour of equivalent algorithms for ordinary differen- 
tial equations (Beam and Warming 1979). Such implicit algorithms (Chaps 8-10, 14, 
17 and 18) are usually equivalent to linear multistep methods of low order. 

The matching of computational algorithms to the characteristics of vector and 
parallel processors (Ortega and Voigt 1985; Gentzsch and Neves 1988) has renewed 
interest in explicit methods, even for obtaining steady-state solutions via a pseudo- 
transient formulation. These are often based on Runge-Kutta time-marching 
schemes in conjunction with a method-of-lines construction (Chaps. 14 and 18). 


7.5 Closure 


Explicit schemes, up to fourth-order, have been constructed for the one-dimensional 
diffusion equation. Higher-order accuracy has been achieved by balancing the 
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truncation errors associated with discretising the spatial term 02 Т/дх? at different 
time levels n and the time derivative 27/01 at different spatial locations j. In practice 
this requires choosing certain free parameters, as in (7.15), so that low- 
order terms in the truncation error are identically zero. 

Explicit schemes usually impose a maximum value on At (through the s par- 
ameter) for stable solutions. Higher-order explicit schemes typically have more severe 
stability restrictions on At. For the DuFort-Frankel scheme it is necessary to limit At 
to obtain accurate solutions even though stable solutions can be obtained with very 
large values of At. ‘ 

Implicit schemes often achieve unconditional stability at the price of additional 
computational effort. As long as the implicit scheme can be restricted to tridiagonal 
form the total computational effort is about double that of solving an equivalent 
explicit scheme. The inclusion of additional terms at the implicit (n+ 1) time level 
allows more flexibility in constructing higher-order schemes. However, the results 
(Table 7.6) suggest that a fine grid may be necessary to obtain a significant improve- 
ment in accuracy. 

If the problem governed by the diffusion equation requires a derivative boundary 
condition specification, additional errors arise in discretising the boundary con- 
dition. For problems governed by a parabolic partial differential equation it is 
important that derivative boundary conditions be discretised with formulae of the 
same order of accuracy as those used in the interior. 

By discretising the spatial term only, the governing partial differential equation 
(PDE) is converted to the semi-discrete form, consisting of a system of ordinary 
differential equations (ODEs) in time. Consequently various algorithms developed 
specifically for ODEs become available for solving the PDE. However, it is empha- 
sised that the equivalent system of ODEs contains errors arising from the spatial 
discretisation. This mitigates against the use of very high-order ODE solvers. The use 
of the semi-discrete form is convenient when a spectral method is employed to 
discretise the spatial terms (Sects. 5.6, 15.3.3 and 17.1.6). 


7.6 Problems 


Explicit Methods (Sect. 7.1) 


71 Derive а) (7.14), b) (7.17). 

7.2 Carry out an approximate operation count for the FTCS, DuFort-Frankel and 
3L-4TH (у--0) schemes at 5= 0.30 and compare the computational efficiency. 

7.3 Construct a two-level five-point scheme that is fourth-order accurate. Asym- 
metric five-point formulae will be necessary adjacent to the boundaries. Test the 
stability of the scheme using the von Neumann analysis. Obtain numerical 
results to confirm the stability restrictions and convergence rate if possible. 

7.4 Consider (7.13) with negative values of y to see if stable and accurate solutions 
can be obtained. For y2 —0.5 can (7.13) be modified to look more like a 
DuFort-Frankel scheme, i.e. by changing the В structure of the spatial deriva- 
tive discretisation? If so, determine whether a stable, higher-accuracy “DuFort- 
Frankel" scheme, can be constructed. 
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Implicit Methods (Sect. 7.2) 


75 
7.6 


7.7 


7.8 


7.9 


Derive (7.30). 

Apply a von Neumann stability analysis to (7.31), with В given by (7.30), to 
determine the y, s envelope for stable operation for the cases: a) 6=0, b) 6 
= 1/12, с) 6=1/6. 

For д = 1/6, y=0, B=1 and s=1/6, (7.31) becomes explicit. Compare solutions 
for this case with those produced by the FTCS scheme. Explore other par- 
ameter combinations that make (7.31) explicit. 

Use program DIFIM to obtain solutions for cases ME=4, 5 with y —0.5. 
Compare the accuracy and convergence rate with the data shown in Table 7.6. 
Obtain an approximate operation count for the cases МЕ = 1,4 апа y=0 in 
Table 7.6 and compare the computational efficiency with 5 = 1.0 with that of the 
FTCS scheme at s=0.5 and the DuFort-Frankel scheme at 5 = 0.3. 


Boundary and Initial Conditions (Sect. 7.3) 


7.10 Modify program DIFEX to implement: 


741 


742 


a) first-order accurate Neumann boundary condition (7.32), 

b) second-order accurate Neumann boundary condition (7.34), and reproduce 
the results given in Table 7.8. 

Programs DIFEX and EXTRA will need to be modified to suit the problem 
defined by Eqs. (7.37) to (7.40) 

Modify program DIFIM to implement: 

a) first-order accurate Neumann boundary condition (7.32), 

b) second-order accurate Neumann boundary condition (7.34), and obtain the 
results indicated in Table 7.10. For JMAX =6 compare the error distribution 
with that shown in Table 7.9. 

Programs DIFIM and EXTRA will need to be modified to suit the problem 
defined by Eqs. (7.37) to (7.40). 

For the DuFort-Frankel scheme starting from the exact solution (subroutine 
EXTRA) at t 22.00, introduce the FTCS scheme on a finer x and t grid to 
obtain the second level of starting data. For s=0.3 and s=0.5 compare the 
accuracy of the FTCS-started DuFort-Frankel solution at t=9.00 with that 
obtained by starting from two levels of data given by the exact solution at 
t— 2.00 and 2.00 — At. 


Method of Lines (Sect. 7.4) 


7.13 


7.14 


Modify program DIFEX to apply the four-stage Runge-Kutta method апа 
obtain results to compare with Table 7.3, for s = 0.41. 
For the scheme developed in Problem 7.13 carry out: 

i) a Taylor series expansion to determine the truncation error leading term 
ii) a von Neumann stability analysis to determine any restriction on s 
iil) an operation count to compare the computational efficiency with that of 
the FTCS scheme. 


8. Multidimensional Diffusion Equation 


A broad conclusion from Chap. 7 is that implicit schemes are more effective than 
explicit schemes for problems with significant dissipation, as exemplified by the 
one-dimensional diffusion equation. 

In extending implicit schemes to multidimensions, special procedures are 
necessary if economical algorithms are to be obtained. The special procedures are 
often built around some means of splitting the equation on a convenient coordinate 
basis (Sects. 8.2, 8.3 and 8.5), The use of splitting constructions also requires careful 
attention being given to the implementation of derivative (Neumann) boundary 
conditions (Sect. 8.4). The splitting techniques developed in this chapter are ap- 
plicable to finite difference, finite element and finite volume methods. 


8.1 Two-Dimensional Diffusion Equation 


In two dimensions, the diffusion equation is written 
дТ Т eT 
— — x —— — Ol — ХІ 
ді дх? ? ay? 
For the region shown in Fig. 8.1, Dirichlet boundary conditions are 


T(0,y,t)=a(y,t) , 
T(Ly,t)- b(y.t) , | (82) 
T(x,0,t)- c(x,t) , 
T(x, 1,t)=d(x,t) , 


T(x,1,t) = d(x,t) 


y= 1 К= МУ 
- -”,А — - 
T10,.!) х kel Тул) 
2 a(y.t) Ay . = b{y,t) 
k-1 


jal ja j ін i*NX Fig. 8.1. Two-dimensional do- 
SO L1 main and Dirichlet boundary 


х:0 Ті х,0,1) (x,t) xz] conditions 
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and initial conditions 
T (x, у,0) = To(x. у). (8.3) 


In this section, typical explicit and implicit schemes for the one-dimensional 
diffusion equation are extended to two dimensions to see if they are directly 
applicable. 


8.1.1 Explicit Methods 


The FTCS scheme, in two dimensions, is 


ату А , 
9; bee Тра Ly T1, 70 where (8.4) 
LaTi o Tat Te ‚ etc. 


Ax? 
As an algorithm this becomes 
Tik =8,T5-1,4+ (1-25, —2s,) Т E STI 1 
+5 Гука HST hk+ > (8.5) 


where s, =a, At/Ax? and s,—a, At/Ay?. A Taylor series expansion about the (j, К, п) 
node indicates that (8.5) is consistent (Sect. 4.2) with (8.1) and has a truncation error 
of O(At, 4х2, Ay’). 

А von Neumann stability analysis demonstrates that (8.5) will be stable if 


s,+5,50.5 . (8.6) 


It may be noted that if s, — s, = s, (8.6) gives 550.25, which is more restrictive than 
the corresponding expression in one dimension (Sect. 7.1.1). However, if small time- 
steps must be used to obtain a sufficiently accurate solution, the restrictive stability 
condition may not be critical. 

For the case a —a, =a, and Ах = Ay, Mitchell and Griffiths (1980, p. 55) provide 
an extension to (8.4), i.e. 

4 Ті i n n 2 n 

Gp tlie T5, 7 aL, Thy 8 M LaL T1470 , (8.7) 
which is stable in the range 0 «sx 0.5. Although (8.7) is a nine-point scheme it can 
be implemented economically in two stages 


Tf,—-(l-t«AtL,)T?;, and (8.8) 
Ti =(1+adeL,.) TE, . (8.9) 
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This scheme extends to a three-stage algorithm in three dimensions while maintain- 
ing the “one-dimensional” stability range 0<s<0.5. By contrast, the three- 
dimensional FTCS scheme with Ax = Лу = Az is stable in the range 0<s 51/6. 
One of the more interesting explicit algorithms is the hopscotch method 
(Gourlay 1970). This method can be interpreted, in its simplest form, as a two-stage 
ЕТС scheme. In the first stage, (8.5) is applied at all grid points for which j -- k -- n 
is even; that is, on a grid pattern corresponding to the black squares on a chess- 
board. In the second-stage the following equation is solved at all grid points for 
which j -- k -- is odd, i.e. on the red squares of an "equivalent" chess-board. 


(1--25,-25) Tii — TA ts (Т К + (T34, +T). (810) 


The terms evaluated at t,, , on the right-hand side of (8.10) are known from the first 
stage. The simple hopscotch method has a truncation error of O(At, Ax?, Ay?) but, 
in contrast to the FTCS scheme, is unconditionally stable. Mitchell and Griffiths 
(1980, p. 77) discuss the hopscotch family of methods in more detail. 


8.1.2 Implicit Method 


Following the same procedure as for one-dimensional problems, it is possible to 
obtain an implicit scheme by evaluating the spatial derivatives in (8.1) at the time- 
level (n+ 1). The resulting algorithm is 


-s Tit}, +(14+2s,+2s,) T1! 5, Tnt, 


ntl 
=s қаны 1—Sy4 jik+1 


Г 
= 


ғ (8.11) 

This scheme has a truncation error of O(At, Ax?, Ду?) and is unconditionally 
stable. However, the difficulty here is in obtaining, economically, the solution of the 
equations that result from applying (8.11) at every node. 

For the present equation, (8.11), it is possible to number the nodes so that three 
terms are on or adjacent to the main diagonal but the other two terms are 
displaced, effectively, by the number of internal nodes across the grid (say, NX —2 
in Fig. 8.1). Consequently, the Thomas algorithm cannot be used. Using con- 
ventional Gauss elimination (Sect. 6.2.1) would be prohibitively expensive; using 
sparse Gauss elimination would still be unacceptably uneconomical for refined 
grids. 


8.2 Multidimensional Splitting Methods 


The problem with the two-dimensional implicit scheme can be overcome by 
splitting the solution algorithm (system of algebraic equations) into two half-steps 
to advance one time-step. At each half-step, only terms associated with a particular 


— MM 


«+ басп патер, опу terms | 
coordinate direction are treated implicitly. Consequently, only three implicit terms 
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appear and these can be grouped adjacent to the main diagonal. As a result, the 
very efficient Thomas algorithm can be used to obtain the solution. The overall 
process of treating each time-step as a Sequence of simpler substeps is referred to as 
(time-)splitting. 


8.2.1 ADI Method 


The best known example of a splitting technique is the alternating direction 
implicit (À (ADI) method (Peaceman and Rachford. 1955). We will examine the ADI 
method in detail and then introduce a generalisation of the splitting approach. 

The ADI interpretation of (8.1) is written in two half time-steps as follows. 
During the first half-step the following discretisation is used: 


Tf. Та n 
AB Ee The —a,L,, T? ,70 , (8.12) 
and during the second 
T'*1—-T* n / 
т PR Deg T tp a, Lp, Tt} =0. (8.13) 


During the first half-step the solution Т is known at time-level n but is unknown 
at the (n+ 1/2) time-level, denoted by *. However, unknown nodal values 7% are 
associated with the x-direction only (i.e. constant value of k in Fig. 8.2). Equation 
(8.12) can be rewritten as one member of a system of equations as 


-055,Т7Ғ1,5%(1-5,)7%,-0.55,7%1, 
-0.55,7", 1-%(1-5,)Т,-0.5,7",.1. | (8.14) 


Other members of the system аге formed around the other nodes in the same row К. 
Thus, the solution of the system of equations gives the intermediate solution 7%, 


j=2,..., NX — 1, for one ES ofk only. Sequentially, systems of equations are 
solve d io for Tt pj223,. X —1, for each row, К--2,..., NY—1, using the 


Thomas algorithm. 


k+] Sweep i in the 
-direction 
k (constont k) 
during the 
k-1 first half 
time-step 
ум 
bM 


Sweep in the y-direction (constant |) 
during the second half time-step Fig. 8.2. ADI implementation 
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During the second half-step (8.13) is used, but in the form 


—0.5s, T^1 1 tu sy) Ti — — 0.55,7% Hem 
=0.55. T$ , t(l- sx) ТҰ, +0.55. T he 1 . (8.15) 


During the second half-step the solution at time-level n+1 is unknown but the 
solution at the intermediate time-level * is known. A system of equations associated 
with all the n nodes along one rid line in the y-direction {j fixed) i 15 solve solved for ТҮ ұз 
k=2,..., NY-1. The process is repeated for each grid line, j= 2... “МІ, in 
turn. 

The stability of the ADI scheme can be ascertained by applying the von 
Neumann stability analysis to obtain the amplification factor for each half-step. 


The stability of the complete time-step is determined by the product of the two half- 


step amplification factors, 1.е. 
аи P RED налы ut t p UP Rt Rr 


1 — 2s, sin?(0,/2) ][ 1 — 25, sin?(0,/2) 
= =. 8.16 
б= 66 | + 2s, sin?(6,/2) || 1+ 2s, sin?(0,/2) (8.16) 


An examination of (8.16) indicates that |6|<1 Юг any value of s,, s,, 9..0). 
However, a consideration of [G'| and |С"| indicates that whereas the full step is 
unconditionally stable, each half-step is only conditionally stable. But only the full 
step is of interest. 

The composite scheme (8.12 and 13) is consistent with (8.1) and has a truncation 
error of O(At?, Ax?, Ay”). The second-order time accuracy follows from the 
symmetry of the scheme, just as the second-order accuracy of the Crank-Nicolson 
scheme followed from the symmetry about the time-level (п + 1/2). 

However, to achieve a global truncation error of O(At?), it is necessary to 
introduce boundary values for the intermediate solution Tf, that are compatible 
with the interior algorithms (8.14 and 15). For example, if Dirichlet boundary con- 
ditions are imposed, the evaluation of T£x,,—b;*!7? at x—1 produces an al- 
gorithm that has a truncation error of O(4t). To produce a truncation error of 
O(At?) it is necessary to evaluate 7%,, from (8.12 and 13) as 


T fx , =0.5(b2 + 6+1) —0.254t L, (bz * ! Б"). (8.17) 


A similar problem arises with approximate factorisation schemes; appropriate 
boundary condition specifications are discussed in Sects. 8.3.2 and 8.4. 

Thus, it may be concluded that the ADI scheme, in two dimensions, has the 
desirable attributes of being unconditionally stable, second-order accurate and 
economical to solve. The ADI scheme extends to the three-dimensional diffusion 

quation where three steps, each occupying At/3, replace (8.14 and 15). In three 
only conditionally stable. It is necessary that Sx» Sys „5. <1.5 for stability where 
8, —a, At/ 422. и ео 
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8.2.2 Generalised Two-Level Scheme 
We now Seek to generalise the splitting concept. A general two-level implicit finite 
difference scheme for (8.1) can be written | 


4Т]ү! 


At — (1 — BY, Lxx T} tO Ly Тук) 


— В(о, І, То ГТ"+1)=0, where (8.18) 


у-уу 


ATi =T TE. 


Here 4T у! can be thought of as the correction to the solution at time-level n 
in order to advance to time-level (n+ 1). In order to minimize the build-up of 
round-off error it is useful to have 477%! appear explicitly in the computer 
program. The role of В in (8.18) is to weight the time-levels n and (n+ 1); this same 
idea was used in (7.24). 

Equation (8.18) will be manipulated to give an implicit algorithm in 47711. 
First the term 79%! is expanded as a Taylor series about the nth time-level, as in 
(3.16), viz 


oT |" oT? |" 
пт", At| — 0.5 At? ... 
D = Tint E Є |% [+ 


which сап be approximated by 





A 
rist=Thatae( гіа 





)+ O(At)) . (8.19) 


Substituting (8.19) into (8.18) gives 


аты 
At 





—(a,L,. Т), +a,L yy Т), к) 


— pla LATE}! +a, Lp AT11!)-0 , (8.20) 


yLyy 4 


or, after rearrangement, 
[1— BAt(e,L,, +a,L,,)] 4 Tik 1 = (а Lyx +a,Ly) The . (8.21) 


Algebraic operators appropriate to both directions appear on the left hand side of 
(8.21). In order to make use of the Thomas algorithm, (8.21) is replaced by the 


approximate factorisation 


(1— BAta,L,,)(1—BAta, Ly) АТ = Аца, Lys tay Г) T2, (8.22) 
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Comparing (8.21) and (8.22) indicates that (8.22) contains an additional term, on 


the left hand side, 
P? At? x, n, Lyx Ly, A T^i. . 


That is, (8.22) approximates (8.21) to O(At?). 
Equation (8.22) can be implemented as a two-stage algorithm at each time-step. 


As the first stage the following system of equations is solved on every grid-line in 
the x-direction (constant k in Fig. 8.2): 


(1 — ВА, Lyx) AT f. Аца, Leg tay Ly) Т", / (8.23) 


for АТ}, which can be thought of as an intermediate approximation to 477; !. 


When L, is a three-point centred difference operator, (8.23) is a tridiagonal system 
that can be solved efficiently using the Thomas algorithm (Sect. 6.2.2). During the 
second stage, the following system of equations 1$ solved: 


(1—BAta, Lp) ATI} =AT#, (8.24) 


on every grid-line in the y-direction (constant j in Fig. 8.2). The structure of (8.23 
24) is similar to that of the ADI scheme (8.14, 15). For more complicated equations 
the evaluation of all the spatial terms on the right hand side of (8.23) is a major 
contribution to the execution time. The present implementation only requires one 
evaluation of the spatial terms per time-step. In contrast, the ADI scheme requires 
two evaluations. The algorithm (8.23, 24) is due to Douglas and Gunn (1964). 

The two-stage algorithm (8.23, 24) is unconditionally stable for 3 £0.5 and has 
a truncation error of O(At?, Ax?, Ay?) if В=0.5. The approximate factorisation 
construction extends to three dimensions and, in contrast to the ADI scheme, is 
unconditionally stable for 320.5. 

If the present formulation is used to obtain solutions of steady problems as the 


steady-state limit of the transient solution (Sect. 6.4), it is useful to define - 
RHS-(nL, aL) Th, - (8.25) 


in (823). As the steady-state solution is approached RHS tends to zero; thus 


monitoring the value of RHS indicates the proximity to the steady-state solution. 
piste Mis теле P Ntra en ае em ripe tpa aid ш d mai SEDI м, аа. ел вики” MEM 


ха een 


8.2.3 Generalised Three-Level Scheme 


For the one-dimensional diffusion equation a generalised three-level implicit 
scheme was given by (7.25). A corresponding three-level scheme is written in two 
dimensions as 
(1--;)4Т7,! )4Т), 
At At 
+ (о... ta, Ly) Ту! , where (8.26) 


= (1 = В)(«, Lyx + a, L,,) Tk 


АТ = ТТ! . 
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With the same construction as was used to develop (8.23, 24) from (8.18), the 
following two-stage algorithm is obtained from (8.26). During the first stage 





At 
(- А as Ls Thra ® Lyx +91) T3 + АТ", (827) 


(1+5) (1 + 7) 
generates a tridiagonal system of equations associated with each grid line іп the х- 
direction. 

During the second stage of every time-step, the following equation is used: 


(- -gf 4a, атат (8.28) 


For the particular choice $= 1, у=0.5 the two-stage algorithm, given by (8.27, 28), 
is consistent with (8.1) with a truncation error of O(At^, Ax?, Ay?) and is 
unconditionally stable. For the first time-step (п--0), a two-level scheme, such as 
given by (8.23, 24), is required. 

The splitting schemes discussed in this section extend naturally to three 
dimensions (Mitchell and Griffiths 1980, p. 85). The modern approach to splitting 
whereby a term, typically of O(At?), is added to the implicit equation to construct 
the factorisation is discussed at length by Gourlay (1977). Higher-order split 
schemes are possible for the diffusion equation in two and three dimensions. Some 
of these are discussed by Mitchell and Griffiths (1980, pp. 61, 87). 


8.3 Splitting Schemes and the Finite Element Method 


Here we apply the Galerkin finite element method (Sect. 5.3) to the two-dimen- 
sional diffusion equation (8.1) with boundary and initial conditions given by (8.2, 3) 
and determine whether the splitting schemes developed in Sect.8.2 must be 
modified to include the finite element form of the discretised equations. Rec- 
tangular elements are used with bilinear interpolating functions (5.59) in each 
element. If the Galerkin finite element method is applied on a grid that is uniform in 
the x and y directions, the result can be written (after dividing all terms by 4x Ay) as 


aT 
M,@M, Е | =4,M, ® LT; ,-4,M,G Ly Tyr , (8.29) 


where 69 denotes the tensor (or outer) product (Mase 1970, p. 15); М, and М, аге 
directional mass operators and L,, and L, are directional difference operators 
(Appendix A.2). The directional mass operators are 


M,=(, 4 0 and M,=( 3, 3)" , (8.30) 


8.3 Splitting Schemes and the Finite Element Method 257 


and the directional difference operators are 


1 2 1 1 2 1 \7 
ban (зная) апа L,- (25 жы) . (8.31) 


Thus, the term М, ® Lyx 7; , gives a nine-point representation of д? T/Ox?. With 
reference to Fig. 8.3, 


Tj-ikx- 2T; yt Ge ы) 


M, LT gy Ен Hae Tiene 5] 





Ax? 3 Ax? 
11|7,-1.-1-21,,-1:57;1.- 
++ j-1,k-1 j,k-1 j+1,k-1 . (8.32) 
6 Ах? 
kel ----оюы! 
nal + ^ + 
" 
n k - —--dk 
4 7 
k-i 4----...... d'k- 
jd j jd jio j je 
sweep in x-direction sweep in y- direction 
(constant k) (constont 1) 


(о) Finite element formulation 





/ 
a ат 
7 
7 
и 4 7 / 
4----4----«<к- 4----4-- --©к-1 
ін j jel j-1 j . 
sweep in x- direction sweep in y- direction 
(constont К) (constant j) 
Fig. 8.3a, b. Active nodes for split 
(b) Finite difference formulation schemes (8.23, 24); (8.39, 40) 


The finite difference representation introduced earlier, for example (8.4) and 
following, can be incorporated into the framework of (8.29) by defining the finite 
difference directional mass operators as 


Mf? (M13)? = (0, l, 0) , 


so that the finite difference expressions will contain only three terms. 

The major difference between finite element and finite difference discretisation 
in multidimensions is that the finite element method evaluates spatial derivatives 
over a region whereas the finite difference method evaluates spatial derivatives 
along a particular coordinate direction. Generally, the finite element formulation is 
more accurate but also less economical, since more terms must be evaluated. 
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If quadratic Lagrange elements are used instead of linear Lagrange elements 
each of the mass and difference operators contains five terms so that a typical 
tensor product, M, @ L,, T; к, contains 25 terms. However, such an expression is 
consistent with the derivative 227 /dy? being represented to O(Ay?) typically. 


8.3.1 Finite Element Splitting Constructions 


In this section, two means of splitting the finite element equations will be devel- 

oped. The first produces an algorithm similar to the ADI scheme (8.14, 15). The 

second algorithm obtains solutions in terms of the corrections 47; , as in (8.23, 24). 

As with the treatment of the one-dimensional diffusion equation a finite 

difference representation will be introduced for dT /dt in (8.29) and the spatial terms 

will be evaluated as a weighted average of the nth and (n+ 1)-th time-levels. Thus, 
(8.29) is replaced by 
A Т" +1 


ЛЕ 
t 





M,@M, -(а,М, Q Lat M: 8 LEA- AT] + BTI] - (8.33) 
Equation (8.33) is the finite element equivalent of (8.17). The ADI finite element 
splitting is obtained by adding Ata, «,L,, & Ly [B T"*!—(1— pP) T"] to the left 
hand side of (8.33) so that it becomes 


(M,—a,BAtL,,) & (M, — a, BAtLyy) Ту! 
=[M,+4,(1—B)4tL,,] & [M, t a,(1— B)4tL, ] T5 (8.34) 
Equation (8.34) can be implemented very efficiently in two stages. In the first stage, 
(М. —«, BAtL,,) Tf, = [M, +a (1-В) 4 Г] 75, (8.35) 
produces a tridiagonal system on each x gridline. In the second stage, 
(M,—a, BAtL,,) Т 2 [M, +a,(1— B) ALL] Tf, (8.36) 


produces a tridiagonal system оп each y gridline. 

If B=0.5 the only difference between the finite element ADI construction (8.35, 
36) and the finite difference ADI construction (8.14, 15), is the appearance of 
the mass operators in (8.35, 36). The ADI finite element algorithm is econ- 
omical, second-order accurate in time and space and is unconditionally stable if 
B 29.5 -- (6 —0.25)/s where д is defined by (8.44) айа s is s, or s,. 

An alternative splitting can be developed in the following way. The term 75;!, 
on the right hand side of (8.33) is expanded about the nth time-level as a Taylor 
series and terms of O(At?) are discarded. After rearrangement, the result can be 
written 


СМ, & М,- ВАКа, M, 9 Ly, -«, M, Q Г, АТ"! 
= At(x, M, Q Lx +4, М, ® Ly) Thy . (8.37) 
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If the additional term f?At?a,a, Lyx & L,, AT 1! is added to the left hand side of 
(8.37) the following splitting is obtained: 


(M, — Bata, L..) © (M,— BAta,L,,) АТ"! 
= At(x, M, Q Ly, +а M, ® Ly) T7, , (8.38) 


which is implemented as 

(M, — BAta, Lyx) AT $= At(a, M, O Lox ta M, & L)T3, (8.39) 
for all grid lines in the x-direction and as 

(M,— Bata, Г) АТ"! 2 ATE, (8.40) 


for all grid lines in the y-direction. 

Equations (8.39 and 40) may be compared with (8.23 and 24). The right hand 
side of (8.39) takes longer to evaluate than the right hand side of (8.23). The right 
hand sides are evaluated at time-level n. As can be seen from Fig. 8.3, nine nodes are 
active at time-level n for the finite element method compared with five for the finite 
difference method. However, the evaluation of (8.39) and the solutions of the 
tridiagonal systems of equations is negligibly different for the two formulations. 
Computational experiments indicate that the finite element method is slower, 
overall, than the finite difference method but usually provides more accurate 
solutions. 

The computational algorithm (8.39, 40) is consistent with (8.1) with a truncation 
error of O(AC, Ax?, Ay?) if B =0.5. The same splitting (8.39, 40) would be obtained if 
quadratic Lagrange elements (Sect. 5.3.2) were used instead of linear Lagrange 
elements, but the operators M,, L,,, etc, would have a maximum of five terms 
instead of three (Fletcher 1984, p. 122). 


8.3.2 TWDIF: Generalised Finite Difference/Finite Element Implementation 


The various splitting schemes considered іп Sects. 8.2.2, 8.2.3 and 8.3.1 can be 
combined into a single generalised three-level scheme that embraces both finite 
difference and finite element formulations. This will be demonstrated here for the 
region shown in Fig. 8.1. 

The temperature distribution in the domain Ox x1, 0x y 1 is governed by 
(8.1) with the particular boundary conditions 


T (0, y, t)=20+80y , 
T(1, y, 2) 204-80 y —e795*** sin(0.52y)] , (841) 
T(x,0,2) 20 , 

T (x, 1,0) 220--80[ y — e^ 954 sin(0.5nx)] . 
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For this example, the following analytic solution is available: 
T —20--80[ y—e 954" 5іп(0.5лх)ѕіп(0.5лу)}. (8.42) 


The generalised scheme can be introduced by applying a three-level finite difference 
discretisation to (8.29) with a=a, =a, to give 


At 
=a(M, @ Lyx - M. & LE-A) T7 ,-- BTI]. 


M, & M, 


(8.43) 


which is equivalent to (8.26). However, here the directional mass operators (8.30) 
will be generalised to 


M,— M7 - (6, 1-25, б}. (8.44) 


Clearly, the finite difference scheme, Sects, 8.2.2 and 8.2.3, corresponds to 6 =0 and 
the finite element scheme, Sect. 8.3.1, corresponds to ô= 1/6. 

The approximate factorisation of (8.43) produces the following two-stage 
algorithm in place of (8.39, 40). 


First stage: 
м-( Фа | ATf,-RHS", where (8.45) 
О м | 
RHS" - [a4t(M, Q L,, +M, Q Ly) T1 4H-yM, ® M,ATS,]/[1--y] . (8.46) 





Second stage: 


At 
ЕЕ 3 Ly |ату'=4ту, | (8.47) 


For equations formed at points adjacent to boundaries, boundary values of 
AT * in (8.45) and AT" * ! in (8.47) must be provided. The appropriate form for these 
boundary values can be determined most easily by forming (8.45, 47) into a single 
composite scheme by eliminating AT'*. It is then clear that boundary values for 
AT"*! required to solve (8.47) can be obtained directly from (8.41), whereas 
boundary values for 47% must be computed from (8.41) via (8.47) reversed, e.g. on 
j-NX(x- I), 





r 


«pAt 
AT Ñx. = (м, ш (E5) ь | 4 T NX . (8.48) 


If the boundary conditions (8.41) are not time dependent then zero boundary 
values for 47% and A7"*! are generated. Known boundary values of dependent 
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variables, 47% and AT"*!, are transferred to the right hand side of (8.45) and 
(8.47), respectively, before solution. 

The above algorithm is implemented in program TWDIF (Fig. 8.4). In the 
first stage, (8.45) generates tridiagonal systems of equations associated with each x- 
gridline. These are solved using subroutines BANFAC and BANSOL (Figs. 6.18 
and 6.19). Equation (8.46) is evaluated in subroutine REDIF (Fig. 8.5). In the 
second stage, (8.47) produces tridiagonal systems of equations associated with each 
y gridline which are solved using subroutines BANFAC and BANSOL. The major 
parameters used in program TWDIF are defined in Table8.1. Typical output 
for the finite difference Crank-Nicolson scheme on a coarse grid is shown in 
Fig. 8.6. 

Program TWDIF is used to compare the Crank-Nicolson, two-level fully 
implicit (2LFI) and three-level fully implicit (3LFI) finite difference schemes with 
the Crank-Nicolson finite element scheme. The results are summarised in Table 8.2. 
For the three-level fully implicit scheme (ЗЕ. ЕТ) the first step is evaluated using the 
Crank-Nicolson scheme. Also included in Table 8.2 are rms errors obtained with a 
modified Crank-Nicolson scheme with ó — 1/12 in (8.44). 

The Crank-Nicolson and 3LFI finite difference schemes are of comparable 
accuracy particularly on a fine grid. This is to be expected since both schemes have 
second-order truncation errors. The very accurate 3LFI solution on a coarse grid is 
considered to be an anomaly. The two-level fully implicit scheme (2LFI) is 
considerably less accurate, and this is consistent with its truncation error of O(At, 
Ax*, Ду?). The Crank-Nicolson finite element scheme has a second-order trunc- 
ation error and demonstrates comparable accuracy to the Crank-Nicolson finite 
difference scheme. However, а Crank-Nicolson scheme with 6=1/12 in (8.44) 
produces a significantly more accurate solution on both coarse and fine grids. This 
is to be expected since this scheme has a truncation error of O(At?, Ax*, Ay*). 

The schemes shown in Table 8.2 are coded as nine-point schemes. However, 
subroutine REDIF can be evaluated more efficiently for finite difference schemes 
by recognising that M, and M, only contain one non-zero entry each. Conse- 
quently, the non-zero contributions to DMX and DMY in subroutine REDIF can 
be grouped into three-component vectors. 

Although the finite element scheme is formally a nine-point scheme, the 
symmetry of the mass operators (8.44) and the requirement of evaluating (8.46) only 
on sweeps in the x-direction permit the following economical procedure to be 
employed. First (8.46) 15 rewritten as 


3 


RHS= Y 41, MYT,--M,,LYT,) , where (8.49) 
vel 
adt k+1 
M Т,. = — , " 8.50 
Yh (ш) У м, Tin: (8.50) 


t k+1 k+1 
күт, -( E) Туу 225) У м,„4Т",, (8.51) 
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1c 
2c TWDIF APPLIES APPROXIMATION FACTORISATION TO SOLVE 

3c THE UNSTEADY HEAT CONDUCTION PROBLEM FOR T(I,Y) . 

ac REDIF EVALUATES THE RIGHT-BAND SIDE. 

5c 

6 DOUBLE PRECISION SUMT,RMST,DSQRT,AN 

7 DIMENSION T(41,41),DT(41,41) ,R (41,41) ,EMY (3) ,EMY (3) 

8 1B(5,41), RRT (41) , DDT (41) , SIE (412 , SIY (41) , Y (41) , ERR 41,41) 
9 COMMON DX,DY,EMX,EMY,NX,NY,R,T,DT 

10 OPEN(1,FILE-'TWDIF DAT’ ) 

11 OPEN(6,FILE=’TWDIF. OUT’ ) 

12 READ(1,1)IBC,NX,NY,ME,GAM,BET 

13 READ(1, 2) ALF ,DTIM, DTMCH, TMAX 


14 1 FORMAT(4IS,2F5.2) 
15 2 FORMAT(4E10.3) 


16 с 

17 РІ = 3141592654 

18 РІН = 0.5*PI 

19 WXS = NX +1 

20 FYS = ШҮ +1 

21 WXP = NX-1 

22 WYP = NY -1 

23 IF(IBC EQ. 1)NIN = FIP 

24 IF(IBC .EQ. 1)NYM = NYP 

25 IF(IBC .Е0. 2)NIN = ИХ 

26 IF(IBC .EQ. 2)NYN = NY 

27 NXPP = NXN - 1 

28 NYPP = WYN - 1 

29 AN = NIPP*NYPP 

30 АЙХ = ИІР 

31 DX = 1./ANX 

32 ANY = БҮР 

33 DY = 1./ANY 

34 SX = ALF*DTIM/DX/DX 

35 SY = ALF*DTIM/DY/DY 

36 EMX(1) = 0. 

37 IF(ME .EQ. 2)EMI(1) = 1./6. 

38 IF(ME .EQ. 3)ЕМХ(1) = 1./12. 

39 EMX(2) = 1. = 2.*EMI(1) 

40 EMX(3) = EMX(1) 

41 203 J = 1,3 

42 3 EMY(J) = EMX(J) 

43 УЕТТЕ( 6,4) ИХ, ИУ, МЕ, САМ, ВЕТ, ЗХ,5У 

44 4 FORMAT(? UNSTEADY HEAT CONDUCTION WITH ШХ,ИҮ -',213,/, 
45 1’ МЕ -',I3," GAM -”,Е5.2," BETA -',F5.2," SX,SY =’, 2F6.3) 
46 IF(ME .EQ. 1)WRITE(6,5)EMX 

47 5 FORMAT(" APPROX, FACT., ЗРТ РОМ, ЕМХ=’,3Е10,3) 
48 IF(ME .EQ. 2)ҸВІТЕ(6,6)ЕМХ 

49 6 FORMAT(' APPROX. FACT., LINEAR FEM, ЕМХ=',3Е10.3) 
50 IF(IBC .EQ. 1)VRITE(6,7) a” 

51 IFCIBC .EQ. 2)WRITE(6,8) 


52 7 FORMAT(' DIRICHLET В.С.?) 
53 8 FORMAT(^ DIRICHLET AND NEUMANN B.C.) 


54 WRITE(6,9)DTIM, DTMCH, TMAX 

55 8 FORMAT(' DTIM z',E10.3,' PRINT INT. s',E10.3,' TMAX =',Е10.3,//) 
56 C 

57 C GENERATE INITIAL SOLUTION 

58 C 

59 DO 11 J - 1,NX 

60 ÀJ12J-71 

61 X = AJ*DX 

62 SIX(J) = SIN(PIH*I) 


Fig, 8.4. Listing of program TWDIF 


100 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 


a 


10 


11 


12 


13 
14 


15 
16 


17 
18 


19 


20 
21 
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DO 10 К = 1, ШУ 

aK = K- 1 

Ү(К) = aKeDY 

SIY(K) = ЅІЙ(РІЯФҮ(К)) 

T(K,J) = 80.%(У(К) - SIX(J)*SIY(K)) + 20. 
CONTINUE 

IF(IBC .EQ. 1)GDTO 14 

ро 12 J = 1,NX 

T(NYS,J) г T(NYP,J) + 160.*DY 

DO 13 К = 1,NYS 

Т(К,ИХ5) = T(K,WXP) 

CONTINUE 

САМЕ = САИ 

САМ = 0. 

ВЕТН = ВЕТ 

ВЕТ = 0.5 

ТІМ = 0. 

ТИСЕ = -0.00001 + DTMCE 
DIM = - O.5*ALF*PISPI 
DO 17 J = 1,NI 

00 15 K = 
DT(K,J) 
R(K,J) = о. 

DO 16 К = 1,5 

B(K,J) = O. 

CONTINUE 

ССХА = BETSDTIM*ALF/DX/DX/(1.*GAM) 

CCYÀ = BET*#DTIM*ALF/DY/DY/(1.+GAM) 

ТҒАС = EXP(DIM*(TIM+DTIM)) - EXP(DIM*TIM) 


ни 
о ы 
т. 
= 
< 


CALL REDIF (АСР , GAM , DTIM, WXN , WYN) 
TRIDIAGONAL SYSTEMS IN THE X-DIRECTION 


DO 21 К = 2,NY" 
DO 19 J = 2,NXN 


И = 1-1 

B(2,JM) = EMX(1) - ССХА 
B(3,JM) = ЕМХ(2) + 2.*CCXA 
B(4,JM) = EMX(3) - CCIA 
RRT(JM) = R(K,J) 

B(2,1) = 0. 


IF(IBC .EQ. 1)DT(K,NX) = - 80. 95Іү(К) ТРАС 

ІЕСІВС .EQ. 1)DUM-(EMY(1)-CCYA)*SIY(K-1)* (EMY(2) *2. *CCYA) *SIY(K) 
1 + (EMY(3)-CCYA) *SIY(K*1) 

IF(IBC .EQ. 1)RRT(JM) = АКТОМ) + B(4,JM)*80. *TFAC*DUM 

IF(IBC .EQ. 2)DT(K,NXS) = DT(K,NXP) 

IF(IBC .EQ. 2)B(2,JM) = B(2,JM) + В(4, ЈИ) 

B(4,JM) = 0. 


CALL BANFAC(B,WXPP,1) 
CALL BANSOL(RRT,DDT,B,WXPP,1) 


DO 20 J = 2,NXN 
ІМз1-1 
R(K,J) = DDT(JM) 
CONTINUE 


TRIDIAGONAL SYSTEMS IN THE Y-DIRECTION 
ро 24 J = 2,NIN 


20 22 К= 2,NYN 
Ku = K- 1 


Fig. 8.4. (cont.) Listing of program TWDIF 
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126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
152 
153 
154 
155 
156 
157 
158 
159 
160 
161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 


e 


e 


22 


23 
24 


25 


26 
27 


28 
29 


30 


31 
32 
33 


34 
35 


36 


37 
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B(2,KM) = EMY(1) - CCYA 

B(3,KM) = EMY(2) + 2.*CCYA 

B(4,KM) = EMY(3) - ССҮА 

RRT(KM) = R(K,2) 

B(2,1) = 0. 

IF(IBC .EQ. 1)DT(NY,J) = - 80.*SIX(J)*TFAC 

IF(IBC .EQ. 1)RRT(KM) = RRT(KM) - B(4,KM)*DT(NY, J) 
IF(IBC .EQ. 2)DT(NYS,J) = DT(NYP,J) 

IF(IBC .EQ. 2)B(2,KM) = В(2,КМ) + B(4,KM) 

B(4,KM) - O. 


CALL BANFAC(B,NYPP,1) 
CALL BANSOL(RRT,DDT,B,BYPP,1) 


INCREMENT T 


DO 23 К = 2,ШҮЙ 

KM = K-1 

DT(K,J) = DDT(KM) 

T(K,J) = T(K,J) + DDT(KM) 
CONTINUE 

IF(IBC .EQ. 1)DT(NY,NX) = 
IF(IBC .EQ. 2)DT(NYS, EXS) 
TIM = TIM + DTIM 

TFAC = EXP(DIM*TIM) 
IF(IBC .EQ. 2)GOTO 26 

DO 25 К = 1,37 

T(K,NX) = 20. + 80.*(Y(K) - TFAC*SIY(K)) 
Т(ИҮ,К) = 20. + 80.*(1. - TFAC#SIY(K)) 
сото 29 

DO 27 J = 1,NX 

Т(ШҮЅ,Ј) = Т(ИҮР,Ј) + 160.*DY 

DO 28 К = 1,115 

T(K,NIS) = T(K,NIP) 

GAM - GAME 

BET = BETE 

IF(TIM .GE. TMCH)GOTO 30 

IF(TIM .LE. TMAX)GOTO 18 

GOTO 37 


80.*TFAC 
DT (WYP , NXP) 


COMPARE VITH EXACT SOLUTION 


TMCH = TMCH + DTMCH 

SUMT > 0. 

DO 33 KA = 1, МУ 

Ке1%ИҮ-Кі 

DO 31 J= 1,1 

ТЕ = 20, + 80.*(Y(K) - TFAC*SIX(J)*SIY(K)) 
ERR(K,J) = T(K,J) - ТЕ 

SUMT = SUMT + (T(K,J) - TE)##2 
WRITE(6, 32) (T(K, J), J=1 8X) 
FORMAT(? T=',11F7.3) 

CONTINUE 

DO 34 KA = 1,NY 

K - 1 * NY - KA 

WRITE(6,35) CERR(K,J) ,J=1, 0X) 
РОВМАТ(? E-',11F7.3) 

RMST = DSQRT(SUMT/AM) 

WRITE(6 ,36)TIM,RMST 

FORMAT(’ TIME =’,E11.4,’ RMST =’ ,D11.4) 
IF(TIM .LE. TMAX)GOTO 18 
CONTINUE 

STOP 

END 


Fig. 8.4. (cont.} Listing of program TWDIF 
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SUBROUTINE REDIF(ALF,GAM,DTIM,NIN,NYN) 


EVALUATES RIGHT-HAND SIDE OF THE 
UNSTEADY HEAT COXDUCTION EQUATION 


сусу сус) 


DIMENSION DT(41,41),T(41,41),R(41,41), 
1DMX (3,3) ,DKY (3,3) , ENX (3) , EMY (3) 
COMMON DI,DY,EMX,ENY,NX,NY,R,T,DT 
ССІ - ALF/DX/DX 
CCY = ALF/DY/DY 
D0 1 I= 1,3 
DMX(I,1) = CCX«EMY(I) 
DMX(I,2) -2,*DMX(I,1) 
DMX(I,3) = DMX(I,1) 
DMY(1,I) = CCY*ENX(I) 
DMY(2,I) = -2,*DMY(1,I) 
1 DKY(3,1) = DMY(1,I) 


2, ИХИ 
2, Шуи 


eee eee е 
шю бо - anton o Q шю OO - DAO AWD ы 


ы 
ma 
ж 
d 
н 

о 


D0 3 M> 1,3 
И):1-2%И 
202 и = 1,3 
EK :K-2*J 
2 RTD = RID + (DMX(N,M)*DMYCI, M) )*TCEK, MJ) eDTIM 
1 + GAM«EMX CH) *EMY (W) *DTCIK, MJ) 
3 CONTINUE 
4 R(K,J) = RTD/(1. + GAN) 
5 CONTINUE 
RETURN 
END Fig. &5, Listing of subroutine REDIF 


ГУЕ 


Fable 8.1. Parameters used by program TWDIF 


Parameter Description 
ALF a=a,=a, in (8.1) 
GAM, BET у, В in (8.43) 
EMX, EMY M,, M, in (8.44) 
IBC = 1, Dirichlet boundary conditions everywhere 
= 2, Neumann boundary conditions at x = 1 and у = 1 
ME =1, finite difference method; 
=2, linear finite element method 
NX, NY maximum number of grid points in the x and y directions 
DX, DY Ax, Ay 
SX, SY 5,» 5, after (8.5) 
DTIM At 
DTMCH increment of time for printing output 
TIM time 
TMCH comparative time for determining printing 
TMAX maximum time 
T nodal temperature, 7%, in (8.43) 
TE Тіп (8.42) 
DT АТ, in (8.46) 
DDT АТ*, and 4711 on return from BANSOL 
B tridiagonal matrix; left-hand sides of (8.45) and (8.47) 
R RHS" in (8.46); also used to store ATF, іп (847) 
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UNSTEADY HEAT CONDUCTION WITH NX,NY «6 6 

ME = 2 GAM = .00 BETA = .50 SX,SY = 1.000 1.000 

APPROX. FACT., LINEAR FEM, EMX= .167Е+00 .667Е%00 .167Е+00 
DIRICHLET В.С. 

DTIM а .400E+04 PRINT INT. = .160Е+05 ТНАХ = .161Е+05 


Т=100.000 88.775 78.650 70.614 65.455 63.677 

T= 84.000 73.351 63.742 56.109 51.193 49.455 

T= 68.000 58.954 50.790 44.300 40.109 38.614 

Та 52.000 45.433 39.505 34.790 31.742 30.650 

T= 36.000 32.549 29.433 26.954 25.351 24.775 

T= 20.000 20.000 20.000 20.000 20.000 20.000 

E»  .000  .000 .000  .000 .000 .000 

E»  .000 .026 .047  .057  .047 .000 

Es  .000 .035 .062  .074 .057 .000 

E»  .000 .031  .054  .063 .047 .000 

Es  .000 .017 .031  .035  .026 .000 . . 

E=  .000  .000  .000  .000  .000 .000 Fig. 8.6. Typical output produced by 
TIME = .1600E+05 RMST = .AT10D-01 program TWDIF 


Table 8.2. rms errors for Dirichlet boundary conditions at t = 24,000 with a= 1 x 1075 and s, = $, = 1.00 


Crank- Crank- Crank- 

Nicolson, 2LFI 3LFI Nicolson Nicolson 

FDM(ME-1) FDM(ME-1) FDM(ME-l) FEM(ME-2) ô=1/12 
Ax=Ay y=0, B=0.50 y=0, B=1.00 y20.5,8—1.00 y=0, )--0.50 y=0, f20.50 


0.20 0.02682 0.40880 0.00306 0.03364 0,002820 
0.10 0.00648 0.08874 0.00528 0.00686 0.000134 
0.05 0.00157 0.02065 0.00150 0.00159 0.000082 


and т-т--К--2, l'Zl—j--2. MYT and LYT are three-component arrays 
associated with grid points j — 1, j and j + 1 when RHS is centred at node ( j, k). The 
structure of (8.49) follows from evaluating all y operators in (8.46) first. 

However, the evaluation of MYT and LYT from (8.50, 51) is the same whether 
the Galerkin node is located at j — 1, j or j + 1. Consequently, it is more efficient to 
evaluate only MYT, using (8.50) and only LYT, using (8.51). Values of MYT, and 
МҮТ, required in (8.49) are obtained by passing along the values of МҮТ; and 
MYT, from the evaluation at the previous Galerkin node, i.e. j — 1 instead of j. 
LYT, and ГУТ, are obtained іп an equivalent manner. 

The above technique has been used by Fletcher and Srinivas (1984) in obtaining 
the incompressible viscous flow past a backward-facing step using the stream 
function vorticity formulation described in Sect. 17.3.3. 


8.4 Neumann Boundary Conditions 


In Sects. 8.2 and 8.3, the splitting schemes are described and implemented with 
Dirichlet boundary conditions. For the one-dimensional diffusion equation first- 
and second-order implementations of Neumann boundary conditions are con- 
sidered in Sect. 7.3.1. 
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Here, procedures will be described for combining Neumann boundary con- 
ditions with the splitting schemes of Sects. 8.2 and 8.3. A typical multidiffusion 
problem has mixed Dirichlet/Neumann boundary conditions. To provide a con- 
crete example, the first and third boundary conditions in (8.2) are retained but the 
second and fourth boundary conditions are replaced with 


ӘТ ôT 
дх (1, У, t) =9(y, t) and 2у % 1, t) =h(x, t) , (8.52) 


where g( y, t) and A(x, t) are known functions. 


8.4.1 Finite Difference Implementation 


The various schemes considered in Sects. 8.2 and 8.3 are spatially second-order 
accurate. А second-order accurate implementation of the Neumann boundary 
conditions (8.52) is 


Tisi Т-к Тұқа- Tiki 
АН and Ду -h0. (8.53) 


The boundary conditions (8.53) will be implemented in conjunction with the 
generalised two-level scheme, Sect. 8.2.2. 

For points lying on the boundaries x = 1 and у= 1, the right hand side of (8.23) 
can be evaluated after (8.53) has been used to generate additional points 75, ,,, and 
T;,,;, that lie just outside the computational domain proper. However, the 
operators on the left hand sides of (8.23 and 24) require the construction of 
corrections AT f, , , and AT 5,1,. These can be obtained from (8.53) by evaluation 
at successive time intervals, i.e. 


АТ" = АТУ +24х Agg! , where (8.54а) 
49"*1=9'*1—9" and (8.545) 
АТ" = АТ" +2AyAht*? . (8.54с) 


The splitting into (implicit) one-dimensional operators then requires modification 
of the relevant components of L,, and L,,, using (8.54). Thus, at j= NX the 
introduction of (8.54a) leads to the following equation replacing (8.23) during the 
first stage: 


(1-о, BAtL™)AT#,=At(agligx xy Ly) T7, 
—2Ax(1 a, BAtL, S) (1 -4 At Ly) Agh*! , (8.55) 


where Г" = {2, —2, 0)/Ах2, However, during the second stage, (8.24) is used 
without modification. At k= NY, (8.23) is used without modification during the first 
stage. However, during the second stage, (8.24) is replaced by 
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(1—a,BAtL™) AT*t!=AT#,—(1—a,BAtL,y3) 2AyAh T?! (8.56) 


where L”,={0, —2, 2}7/Ay? . 

The above procedures have been applied to the example considered in 
Sect. 8.3.2 with ô T/ôx(1, y, t) 0 and дТ/ду(х, 1,t)=80, replacing the relevant 
Dirichlet boundary conditions. However, for this particular choice of boundary 
conditions д and h іп (8.52) are constants so that the right hand sides of (8.55 and 
56) coincide with the right hand sides of (8.23 and 24). For the present case, the 
exact solution is given by (8.42). Solutions have been obtained with the schemes 
indicated in Sect. 8.3.2; the rms errors are summarised in Tabie 8.3. 


Table 8.3. rms errors for Dirichlet/Neumann boundary conditions at t= 24,000 with a= 1 x 1075 and 
$: =5, = 1.00 


Crank- Crank- Crank- 
Nicolson 2LFI 3LFI Nicolson Nicolson 
ЕОМ(МЕ =1) ЕОМ(МЕ = 1) FDM(ME = 1) ЕЕМ(МЕ = 2) б= 1/12 
Ах(-Ау) у=0, В=0.5 у=0, f 21.0 y20.5, В=1.0 y=0, f 20.5 у=0, f 20.5 
0.20 0.12940 1.58100 0.03985 0.15720 0.01341 
0.10 0.03199 0.41120 0.02640 0.03358 0.00070 
0.05 0.00779 0.10080 0.00744 0.00788 0.00021 


Trends are similar to those associated with Dirichlet boundary conditions 
(Table 8.2) except that the rms error levels are larger for all methods when 
Neumann boundary conditions must be satisfied. А typical error distribution is 
shown in Fig. 8.7. It is clear that the largest errors are occurring on and adjacent to 
boundaries where Neumann boundary conditions are applied. А similar effect was 
apparent (Table 7.9) with the one-dimensional diffusion equation and with the 
Sturm-Liouville problem (Sect. 5.4. The largest errors when all boundary 


UNSTEADY HEAT CONDUCTION WITH NX,NY = 6 6 

МЕ = 1 САМ = .00 ВЕТА = .50 SX,5Y = 1.000 1.000 

APPROX. FACT., ЗРТ РОМ, EMX* .000Et00 .100Et01 .000Е+00 
DIRICHLET AND NEUMANN В.С. 

DTIM = .400Е+04 PRINT INT. = .240E+05 TMAX = .250E+05 


Т=100.000 92.370 85.487 80.024 76.517 75.309 

T= 84.000 76.743 70.197 65.002 61.667 60.517 

T= 68.000 61.827 56.259 51.839 49.002 48.024 

T= 52.000 47.515 43.469 40.259 38.197 37.487 

T= 36.000 33.642 31.515 29.827 28.743 28.370 

T= 20.000 20.000 20.000 20.000 20.000 20.000 

Ұл „000 -.067 -.127 -.174 -.205 -.216 

Ез „000 -.063 -.121 -.166 -.195 -.205 

Ба .000 -.054 -.103 -.141 -.166 -.174 

Бл .000 -.039 -.074 -.103 -.121 -.127 

E= “000 -.0 -. -. -. -. . . А m 
E»  .000 1000 “O00 “O00 “000 “O00 Fig. 8.7. Error distribution with Dirichlet/ 


TIME = .2400Е%05 RMST = .1294D+00 Neumann boundary conditions 
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conditions are of Dirichlet type occur in the interior at the points furthest from the 
boundaries, 

An interesting feature of the results shown in Table 8.3 is the high accuracy 
associated with the “fourth-order” interior scheme, 6=1/12 in (8.44). It may be 
recalled that, for the one-dimensional diffusion equation (Sect. 7.3.2), the intro- 
duction ofa second-order accurate boundary condition limited the accuracy even if 
a higher-order interior scheme was used. 


8.4.2 Finite Element Implementation 


For governing equations, such as (8.1), in which second derivatives are present the 
implementation of Neumann boundary conditions enters into the finite element 
discretisation process. This was illustrated for the one-dimensional Sturm-Liouville 
problem in Sect. 5.4.1. 

Application of the Galerkin finite element method to (8.1) produces the 
following weighted integral statement (Sect. 5.1): 


19.27 dx dy= aff e (s 214 -+ Le 5 ndr. (8.57) 


In contrast to the development given in Sect. 5.1, the approximate solution for Т 
has not yet been introduced in (8.57). Here ф„ is the weight function associated with 
the Galerkin node. Two cases are of interest in relation to implementing Neumann 
boundary conditions. First, m is a node on the boundary х=1 (Fig. 8.1); second, m 
is а node on the boundary y=1. It will be assumed that ф, is a bilinear 
interpolating function (Sect. 5.3) although the approach is applicable to any order 
of interpolation. Because of the local nature of @,,(x, y) only two elements make а 
non-zero contribution to the eventual evaluation of (8.57). The two cases of interest 
are shown іп Fig. 8.8. 
The terms on the right hand side of (8.57) are integrated by parts. That is, 


227 ЭТ |^ 0$, 0T 
Ф.т ixdy- J| = | 4] ddr. (8.58) 


mth (Golerkin) node 


1-1 М jel 





N m'h(Golerkin} elem.C elem. D 
node 


Fig. 8.8. Implementation of Galerkin finite element method at 
boundaries. (a) boundary x= 1; (b) boundary y=1 
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Пө теа, Фо, | &-[[ Фес aay вз) 


For the boundary х= 1 the evaluation over the line integrals in (8.58 апа 59) only 
produce a net non-zero contribution on boundary R in each element. The 
weighting function фи is zero on boundaries В and L of element С and Г ahd T of 
element D. The line integral contribution from boundary B of element D identically 
cancels the contribution from boundary T of element C. 

A conventional approximate solution is introduced for T appearing in the area 
integrals 


T=}, ф(х, y)f, . (8.60) 


From (8.52), 0T/Ox|,-, —g(t). А one-dimensional interpolation of g(t) is intro» 
duced by 


g(t)- 9; $7(у) g,(t) - (8.61) 
4 


For Lagrange interpolation, ф.(х, у)-Ф%х) фУ(у). Evaluation of the various 
integrals іп (8.57-59) оп a uniform grid and division by Ax Ay gives 


|^ & M. ог], БЕН +M, 69 Іш Т) „)+ьм, @ Ly га . (8.62) 





У ді 4 


The structure of (8.62) is the same as the interior form (8.29) except for the 
additional term а, M, g,/ Ax associated with the Neumann boundary condition. But 
since only two elements contribute, the operators M, and Lyx have the form 


11 1 1 
м. of and ыт iar 722” o} . (8.63) 


Operators M, and L,, are the same as in (8.30 and 31). 

Implementation of the Neumann boundary condition on у=1 follows an 
equivalent path except that the line integrals 1n (8.58, 59) only produce a net non- 
zero contribution on boundaries Т in elements C and D in Fig. 8.8b. The result 
equivalent to (8.62) 1s 


дт М.В, 
|м. & M, a | =[вм,®ыть+®( Нем. Ты} |. (8.64) 


In (8.64) operators M, and L,, are given by (8.30 and 31) but M, and L,, have the 
components 


м-0 1V аа ре LU (8.65) 
у 376 ап yy , Ау?” Ay? : t 
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It is of interest to compare the treatment of Neumann boundary conditions by 
the finite difference and finite element methods. The equivalent finite difference 
form to (8.29) is 


oT 
Е =d; Lye Typ tty Lyy Tiko (8.66) 
Pk 


where L,, and L,, are given by (8.31). Introduction of (8.53) on x 21 produces the 
following local form for (8.66): 


ôT g 
Fae eae т.) +a, Ly Тук, (8.67) 


where Lys is now given by (8.63) and L,, by (8.31) as before. 

In introducing the concept of mass operators it has been noted (Sects. 5.5 and 
8.3.1) that an equivalent finite difference representation сап be constructed by 
lumping the mass operators, Thus, in (8.62) the mass operators would be 


МЕ-10, 0.5, 0} апа МЇ#={0,1,0}7. (8.68) 


The resulting form of (8.62) is identical with (8.67). Thus, for bilinear Lagrange 
interpolation on rectangular finite elements, it is possible to implement Neumann 
boundary conditions with the Galerkin finite element method by introducing an 
additional set of points T;,,,, and 7, ,, , given by (8.53) and to apply an interior 
formula, such as (8.29), throughout. It should be stressed that although this 
approach is expedient for coding efficiency, it should only be used after the 
equivalence has been demonstrated. 

Typical results using the finite element method with Neumann boundary 
ӘТ/дх(1,у, t)=0 and дТ/ду(х, 1, t)=80, are shown in Table 8.3. The trends аге 
similar to those with Dirichlet boundary conditions except that the errors are 
generally larger when Neumann boundary conditions are present. This same trend 
was apparent with the finite difference method. 


8.5 Method of Fractional Steps 


For the implicit methods described in this chapter the overall strategy has been to 
discretise and then to manipulate or modify the resulting algebraic equations to 
generate the ‘one-dimensional’ algorithms like (8.27 and 28). 

An alternative strategy to the above is fo split the governing equation, for 


example (8.1) into a pair of equations, each of which is locally one dimensional. 
Thus, (8.1) is replaced by 


T 
0,5---< ap and (8.69) 
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ôT „2° Т 
0.5 377 ax? =0. (8.70) 
The equations аге discretised and solved sequentially at each time-step. This class 
of methods was developed by Soviet mathematicians and is described in detail by 
"Yanenko (1971) and Marchuk (1974). Mitchell and Griffiths (1980, pp. 70—74) refer 
to this class of methods as Tocally one-dimensional methods. 
An explicit implementation of (8.69, 70) is 


mee +a,4tL,,)T%, and (8.71) 
а, At Lya) Th"? . (8.72) 


This scheme coincides with (8.8 and 9) if «=a,=«,. The algorithm (8.71, 72) has а 
truncation error of O( At, Ах2, Ay?) and, if Ax — Ay, is stable for 550.5. 
A Crank-Nicolson (implicit) implementation of (8.69 and 70) is 


(1-O0.5a, 4tL,,) T2717? =(1+0.5 a, АИ.) T7, (8.73) 
and | 
(1-0.5a,4tL,,) Г"! =(1 +0.5 a, 4tL,,) Т? . (8.74) 


Equations (8.73, 74) lead to tridiagonal systems of equations along y and x gridlines 
respectively; consequently, the solution can be advanced in time economically 
using the Thomas algorithm. The Scheme (8.73, 74/5 second-order accurate in time 
and space with appropriate boundary condition specification and unconditionally 
Stable in two and three dimensions. 

Equations (8.73 and 74) can be combined into a single composite scheme by 
eliminating Т? ү 7 as long as the L,, and L,, operators commute. That is, the same 
formula is produced by L.,L,,T5,!? as by ГГ». Туи. The resulting com- 
posite scheme coincides with the ADI composite "scheme ‘produced by eliminating 
T f, from (8.12 and 13). 

However, a major difference occurs in the treatment of boundary conditions. 
For the method of fractional steps applied in the two-dimensional domain shown 
іп Fig. 8.1 a Dirichlet boundary condition on x=1, Т(1, y, t)=b(y, t) is imple- 
mented, traditionally, at the intermediate time-level as 





TR -btu А (8. 75) 


апа similarly for boundary conditions on other boundaries. This treatment effec- 
tively reduces the accuracy of the overall ll scheme to first order in time. 

Dwoyer and Thames. (1981): examine the problem of correctly implementing 
boundary conditions in conjunction with a two-dimensional transport equation 
(Sect. 9.5). Mitchell and Griffiths (1980) show that the correct boundary condition 


on x — 1, when using (8.71 and 72), is 


TAM =(1+4a,AtL,,) BL. (8.76) 
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This suggests that, when using (8.73, 74), compatible intermediate Dirichlet 
boundary conditions on x=1 may be obtained by solving 


(1—0.5 a, AtL,) TREY? =(140.5 a, AtL,,)b% . (8.77) 


It may be noted that the method of fractional steps does not provide an 
economical algorithm in terms of the corrections A771! , as was possible with 
approximate factorisation (Sect. 8.2.2 and following). Also, the method of fractional 
steps does not provide a direct evaluation of the steady-state residual (8.25), which 
is important when solving steady problems with a pseudotransient formulation 
(Sect. 6.4). 


8.6 Closure 


Multidimensional explicit methods are appropriate if small time-steps are necessary 
to obtain a sufficiently accurate solution. In addition it is often easier to apply a 
multidimensional explicit method efficiently on a parallel-processing computer. 
For multidimensional parabolic partial differential equations, e.g., the diffusion 
equation, implicit schemes are often more effective than explicit schemes, primarily 
due to their inherently more stable behaviour, particularly if the accuracy of the 
spatial solution is more critical than that of the temporal solution. Additionally, an 
implicit formulation provides more flexibility for constructing higher-order 
Schemes (Mitchell and Griffiths 1980, p. 61). - 


To retain the economy of the Thomas algorithm with a multidimensional 
implicit formulation, it is necessary to introduce some form of directional splitting. 


The recommended construction is to cast the equations as a linear system for the 
correction 477; ! and to introduce an approximate factorisation, e.g. (8.22), which 
permits a multistage algorithm to be applied with each stage requiring the solution 
of a tridiagonal system of equations. 

Approximate factorisation is effective with both the finite difference and finite 
element methods. The appearance of directional mass operators in the finite 
element approximate factorisation algorithm (8.39, 40) provides a means of obtain- 
ing a spatially more accurate scheme; that is, by choosing ó—45 in (8.44). The 
higher accuracy is achieved with both Dirichlet (Sect. 8.3) and Neumann (Sect. 8.4) 
boundary conditions. 

However, to maintain second-order temporal accuracy special attention must 
be given to the implementation of the boundary conditions for the intermediate 
solution correction 47%, that arises with the approximate factorisation con- 
struction. Although the finite difference and finite element methods handle 
Neumann boundary conditions in conceptually different ways, the form of the 
discretised equations are often structurally equivalent. 


The splitting (approximate factorisation) techniques developed in this 


chapter are applicable, with minor modification, to the two-dimensional transport 
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equation (Sect. 9.5), the two-dimensional Burgers’ equations (Sect. 10.4) and the 
equations governing various classes of fluid flow, particularly when the Navier- 
Stokes equations are to be solved, e.g. Sects. 17.2.1, 17.3.3, 18.3 and 18.4. 


8.7 Problems 


Two-Dimensional Diffusion Equation (Sect. 8.1) 


81 Apply the von Neumann stability analysis to (8.7) and show that the scheme is 
stable if 0 <s £0.5. 

8.2 For Ах= Лу deduce what truncation error (8.7) has if s= 1/6. 

8.3 Modify program TWDIF to implement (8.7) and verify the theoretical 
results of Problems 8.1 and 8.2. 


Multidimensional Splitting Methods (Sect. 8.2) 


8.4 Apply the von Neumann stability analysis to a three-dimensional equivalent 
of the ADI scheme (8.14 and 15) and the approximate factorisation scheme 
(8.23 and 24). Show that the three-dimensional ADI scheme is conditionally 
stable but that the approximate factorisation scheme is unconditionally stable. 

8.5 Modify program TWDIF to implement the ADI scheme (8.14 and 15) and 
compare the accuracy and economy with that of the approximate factorisation 
scheme ME=1, y=0, В=0.5. 

8.6 Repeat the comparison shown in Table 8.2 for s,=s,=0.5 and 1.5, and 
explain the results. For s,=s,=0.5 compare the results with the scheme 
developed in Problem 8.3. 


Splitting Schemes and the Finite Element Method (Sect. 8.3) 


87 Modify the program developed in Problem 8.5 to implement ADI-FEM (8.35, 
36) and compare the accuracy with that of the solutions presented in Table 8.2. 
88 Show that the scheme (8.43, 44) with у=0, В —0.5 is equivalent to the scheme 


[1—(05«4t—5Ax?) L,,] Tf ,—[1--(05a4t-04y7) Ly]; > 
[1—(0.5x4t—64y?) Г] T; ! [1c (0.5n4t--0Ax7?) L] TT, . 
For the special case д = 1/12, this scheme coincides with the spatially fourth- 
order scheme of Mitchell and Fairweather (1964). 
8.9 Replace (8.48) with AT fx , evaluated at t,,,,2 from (8.41) in program 
TWDIF. Compare the solution accuracy with that given in Table 82. 
Neumann Boundary Conditions (Sect. 8.4) 


8.10 Apply program TWDIF with Neumann boundary conditions to obtain 
solutions for the methods given in Table 8.3 for the cases s, — s, — 0.5 and 1.5. 
Discuss the results. 
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8.11 Modify the program developed іп Problem 8.3 to obtain the solution with 
Neumann boundary conditions comparable to the case considered in Table 
8.3 with s,=s,=0.5. 

8.12 Modify the ADI program developed in Problem 8.5 to solve the same test case 
as Table 8.3 is based on. Compare the solution accuracy with that given in 
Table 8.3. 


Method of Fractional Steps (Sect. 8.5) 


8.13 Modify the program developed in Problem 8.5 to implement (8.73 and 74) and 
test on the case considered in Table 8.2, 

8.14 Modify the program developed in Problem 8.12 to implement the Dirichlet 
boundary condition on х= 1 using (8.77) and compare the accuracy of the 
solutions with that of the conventional method of fractional steps. 


9. Linear Convection-Dominated Problems 


For most flow problems the motion of the fluid is an important determining factor 
of the overall flow behaviour. In the equations governing fluid dynamics (Chap. 11) 
the fluid motion is characterised by the velocity components u, v, w in the x, y, z 
(Cartesian coordinates) directions. In the one-dimensional x-momentum equation 


ди ди\ Op д?и 
(renatus (9.1) 


the velocity component и appears in the inertia term (du/dt+udu/dy) and the 
viscous diffusion term д(д?и/дх?). The other terms аге density (о), pressure (p) and 
(constant) viscosity (и). 

Previously, in considering the diffusion equation, we examined the behaviour of 
terms like 0^u/Ox?. Now we consider convective terms, like иди/дх, and how they 
can be best handled, computationally. 

The convective term has two separate features that must be taken into account. 
To begin with it contains first derivatives in the spatial coordinate. If a symmetric 
three-point formula is used to represent the term ди/дх, non-physical oscillations 
develop in the solution if the viscous term is small compared with the convective 
term. This behaviour is related to the dispersion-like influences (Sect. 9.2) in the 
truncation error. For steady flow this phenomenon will be examined in Sect. 9.3. 

If asymmetric algebraic formulae are used to represent ди/дх then, although 
the smoothness of the resulting solution is often improved, the accuracy of the 
representation of ди/дх is typically one order lower than the corresponding 
representation obtained with a symmetric algebraic formula spanning the same 
number of gridpoints. For low-order asymmetric formulae this can introduce terms 
in the truncation error that are of comparable magnitude to important physical 
terms being represented. 

This aspect will be considered in Sect. 9.2 for the pure convection equation, in 
бесі. 9.3 for the steady convection-diffusion equation and in Sect.9.4 for the 
transport equation. The transport equation is formed from (9.1) by linearising the 
convective term (i.e. replacing иди/дх with гди/дх), dropping the pressure gradient 
term др/дх and assuming that ғ and о are known. 

The second important feature of the convective term is that it is nonlinear in the 
dependent variable. For supersonic inviscid flow (Sect. 11.6.1) the nonlinear nature 
of the convective terms permits shock waves to appear. The nonlinear nature of the 
convective terms will be discussed, primarily in relation to Burgers' equation, in 
Chap. 10. 
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9.1 One-Dimensional Linear Convection Equation 


To examine the problems of including convection the following linear equation will 
be considered: 


—+u—=0 , (9.2) 


where и is the known velocity and T is a passive scalar, e.g. temperature. Equation 
(9.2) is hyperbolic (Sect. 2.2) in character. Equation (9.2) can be interpreted as a 
model for the convective part of the energy equation (11.44). 

If u is constant and positive then a general solution of (9.2) can be written as 


T(x, t)e F(x-ut) , (9.3) 
where the initial condition is given by 
T (x, 0)= F(x) . (9.4) 


and F(x) is known. If F(x) is specified over the complete x range, - co Ex oo, 
then the solution at some specific location (хи, t,) in the (x, t) plane is equal to the 
solution at x, —ut, at time t—0, і.е. 


T(x,—t,)=F (x, —ut,)- T(x,— ut, 0) . 


This is illustrated in Fig.9.1. The solution T is constant along lines such as AB, 
which is a characteristic for this equation (Fig. 2.5). 


—— — —À «же —À е-е —À —À —À —À е-е «ее еме «ә — — —À ө-- B(x, t) 





Т(х,0)= Р(х), Fig. 9.1. Dependence of the solution 


A(x,-ut,,0) x on the initial data 


9.1.1 FTCS Scheme 


The simplest algorithm for the diffusion equation is the FTCS scheme (7.5). The 
corresponding forward time, centred space (FTCS) finite difference representation 
for (9.2) is 
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As an algorithm (9.5) can be written 
T3**=T?-OS5C(T},,—TI-4) , (9.6) 


which is consistent with (9.2) with a truncation error of O(At, 4x’). In (9.6), С is 
called the Courant number and is defined by 


At 
=y— . 97 
С и (9.7) 


Application of the von Neumann stability analysis to (9.6) produces ап ampli- 


fication factor G for the mth component of the initial error distribution that is given 
by 


G-1-iCsinü. (9.8) 


Clearly, |G|Z 1 for all 0 so that (9.6) is unconditionally unstable. It may be recalled 
that the equivalent (FTCS) scheme was conditionally stable for the diffusion 
equation. It follows, from the lack of stability, that the FTCS scheme is of no 
practical use for pure convection problems. For completeness the properties of the 
FTCS scheme applied to (9.2) are summarised in Table 9.1. 


9.1.2 Upwind Differencing and the CFL Condition 


An alternative scheme is obtained by introducing a backward difference formula 
for 2Т/дх assuming that и is positive. Thus, (9.2) is discretised by 


T'*!-T* u(T5 — T5.) 
Ld PL РИ 9.9 
At + Ax 0. 9 ) 


which as ап algorithm, becomes 
Ti =(1-С)Т"+СТ"_, . (9.10) 


For negative и (9.10) is replaced by 7771-(1-|С|)Т; |C| Т, ,. Equation (9.10) 
indicates that the solution T7*!, is determined by information upwind of node 
(j, n). Hence, (9.10) will be referred to as an upwind scheme (Table 9.1). Numerical 
solutions using the upwind scheme are provided in Sect. 9.1.5. 

The von Neumann stability analysis applied to (9.10) gives the amplification 
factor shown in Table 9.1. Stable solutions are obtained if 


At 
C=u— <1. 9.11 
Way! ( ) 


The inequality C<1 is referred to as the Courant-Friedrichs-Lewy (CFL) 
condition. This condition applies, generally, to explicit schemes for hyperbolic 
partial differential equations. Physically, the CFL condition indicates that a 
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particle of fluid should not travel more than one spatial step-size 4x in one time- 
step At. 

For the particular case C=1, (9.10) gives T7*'=T7T4_,, which is the exact 
solution of (9.2); this can be seen from Fig. 9.1. 

A Taylor series expansion, about node ( j, n), of the exact solution substituted 
into (9.10) produces the result 


oT oT oT 
at ar "ax OM 2 


д? Т 
КЛЕТ Ах2-0. (9.12) 
Thus, (9.10) is consistent with (9.2) but has а truncation error, whose leading terms 
аге O(At, Ax). From (9.2) 


oT — oT d OT. SUTSO 
ôt ^ дх at? И 9х? 


so that (9.12) сап be written as 


ӨТ ull osui O55 қан, 4х?)=0. 


If (9.10) is interpreted as having a truncation error of O(At?, 4x7), (9.10) is seen 
to be consistent with 

oT | OT „0 Т 

зх xt 043) 
instead of (9.2). Therefore, the use of the two-term, upwind finite difference 
representation for 2Т/дх, combined with the forward difference formula for 27/01, 
is seen to introduce an artificial (numerical) diffusivity «’ = 0.5u Ax(1 — C). Clearly, 
the artificial diffusivity is zero when С =1; this is not surprising since the solution 
given by (9.10) then coincides with the exact solution. 

It might appear that setting At to give С = 1 would avoid the artificial diffusivity 
problem. Although this is true for the linear convection equation, it is not possible 
to ensure C=1 everywhere for nonlinear equations, like Burgers’ equation 
(Sect. 10.1), where и (in C) is spatially varying. 


9.1.3 Leapfrog and Lax-Wendroff Schemes 


Following the same approach as for the diffusion equation we seek a more accurate 
representation of (9.2) by introducing centred differences in both time and space. 
This gives the leapfrog method 


T_T) ште TAa) 
1; —1j Uintia) 9.14 
2447 % 24x 0, (914) 
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which as an algorithm becomes 
ЕТ и СТ". T31). (9.15) 


Equation (9.15) is consistent with (9.2) with a truncation error of О (412, Ax?). The 
application of the von Neumann stability analysis to (9.15) produces the amplifi- 
cation factor С shown in Table 9.1. It can be seen that |G| 2 1.0 if C<1 but |G|>1.0 
for some values of 0 if C» 1. Thus, if the CFL condition is satisfied, the leapfrog 
Scheme 15 neutrally stable. This is desirable since it implies no damping of the 
solution as time advances; which is what the physical process of convection (9.2) 
also implies. 

It can be seen that the solution 77*! given by (9.15) leaps over the solution at 
Т"; hence, the descriptive name: leapfrog method. However, the structure of (9.15) 
implies the grid is treated like a chessboard; the solution on black 'squares' is 
completely independent of the solution on red ‘squares’. This can lead to two ‘split’ 
solutions. Roache (1972, p. 60) discusses strategies, such as averaging the solution 
at time-levels n and n+1, to overcome this problem. 

Haltiner and Williams (1980, p. 111) have analysed the leapfrog scheme (9.15) 
applied to a problem with a harmonic initia] condition, ie. (9.4) becomes 
Ғ(х)-ехр(1/т Ах) where m is the wave number (Sect. 9.2). Haltiner and Williams 
obtain an analytic solution of (9.15) which consists of two waves or modes 
associated with the fact that three time-levels are linked by (9.15). One mode 
corresponds to the physical solution and the other is a spurious computational 
mode which propagates in the opposite direction to the physical solution and 
whose magnitude alternates in sign at each time step. 

For the linear convection equation, the computational mode can be suppressed 
by an appropriate choice of the additional level of initial conditions, i.e. Т"! as 
well as T". However, for nonlinear problems the "computational" mode, inherent 
in the leapfrog scheme, may develop and aggravate the separation of the solutions 
at consecutive time steps referred to above. Strategies such as averaging essentially 
suppress the “computational mode" with little effect on the physical mode. ` 

Being a three-level scheme the leapfrog method requires the use of ап 
alternative method for the first time-step. The leapfrog method is recommended 
(Morton, 1971) for hyperbolic problems like the convection equation (9.2); this may 
be contrasted with the unstable nature of the (equivalent) Richardson scheme for 
the diffusion equation. 

The Lax-Wendroff method has been a very effective (and popular) algorithm for 
solving the equations that govern inviscid, compressible flow (Sects. 10.2 and 
11.6.1). For the convection equation (9.2), the Lax- Wendroff scheme coincides with 
Leith's method (Roache 1972, p. 77). These methods are based on constructing a 
second-order representation for the term 27/2: from a Taylor series expansion. 
Thus, 


oT Т.Т" oT 

—— xm —0.54t— 

ді At д? 
T^*1- п 2 
=—/ Т) 0.5At ,9 T 
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Introducing the centred finite difference expression for 02 Т/дх2 gives the 
Lax-Wendroff representation of (9.2) as 


T'* = T" -05C(T*, , — T* ,)405C(T* ,-2T*  T7,.4). (9.16) 


The Lax-Wendroff scheme is consistent with (9.2) with a truncation error of O(At?, 
Ах?) and is stable if the CFL condition C x 1.0 is satisfied (Table 9.1). Numerical 
solutions using the Lax-Wendroff scheme are provided in Sect. 9.1.5. 


9.1.4 Crank-Nicolson Schemes 


The Crank-Nicolson scheme is very effective when applied to the one-dimensional 
diffusion equation. In this section, Crank-Nicolson finite difference and finite 
element schemes are applied to the one-dimensional convection equation (9.2). 
The Crank-Nicolson finite difference scheme can be written 
nal п 
Т" + — Т 


4i 2 +u(0.5L,T%+05L,T%*')=0 , (9.17) 


where L, 7;=(T;,,—7;-~1)/24x. Equation (9.17) produces the following tridiago- 
nal algorithm: 


—025CT**i-- T5! -025CT7112025CT7.,- T7-025CT7,, , (918) 
which can be solved efficiently using the Thomas algorithm (Sect. 6.22). The 
Crank-Nicolson scheme (9.17) is consistent with (9.2) and has a truncation error of 
O(At?, Ax?). A von Neumann stability analysis produces the amplification factor 
С shown in Table 9.1. It is clear that the Crank-Nicolson scheme is unconditionally 
stable. 

The application of the Galerkin finite element method with linear interpolation 
in Space to (9.2) produces the following system of ordinary differential equations: 


Mr | +uL,T;=0 , (9.19) 
dt |, 


where the mass operator M, = (1, 4, 4}. Introducing Crank-Nicolson differencing 
for d T/dt gives the scheme 


ТЇ _ Т" 
m| EL +и[0.5(Т/+Т/*!)]=0, (9.20) 


which may be compared with (9.17). The mass operator can be generalised, as in 
(8.44) to 


M,={6, 1-25, б}, (9.21) 


where б =} produces the finite element form and 6 =0 produces the finite difference 
form. Substituting (9.21) into (9.20) produces the tridiagonal algorithm 
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(6-0.25C) Т"*1+(1-—26)Т"*1+(5+0.25С)Т"#! 


=(5+0.25C)T%_, +(1—-26)T%+(6-0.25C)T%,, . (9.22) 


This algorithm is executed in program TRAN (Fig. 9.8). 

The finite element form (9.20) has a truncation error of O(At?, 4x*). Thus, for 
sufficiently small time-steps that the solution error is dominated by the spatial 
discretisation, the Crank-Nicolson finite element method produces more accurate 
results than the Crank-Nicolson finite difference method. The second-order trunc- 
ation error shown in Table 9.1 comes from the modified equation treatment 
(Sect. 9.2.2) of the 4t* term. For C and u of O(1), this is the dominant term in the 
truncation error. The Crank-Nicolson finite element scheme (9.20) is uncon- 
ditionally stable (Table 9.1). 

Typical implicit schemes, considered for the diffusion equation, are un- 
conditionally stable when applied to (9.2), as indicated in Table 9.1. However, the 
use of implicit schemes for hyperbolic equations is not necessarily advantageous. 
The use of an implicit scheme leads to a system of coupled equations at a given 
time-level n+ 1. Therefore, a perturbation, e.g. due to round-off, introduced at опе 
node (j, n) affects the solution at all other nodes (j, n+ 1) at the next time-level. 
Physically, this behaviour corresponds to a parabolic partial differential equation 
(Sect. 2.3.2) like the diffusion equation. 

In contrast, disturbances to solutions governed by hyperbolic partial dif- 
ferential equations (Sect. 2.2.2) are propagated at finite speeds. Using implicit 
Schemes with hyperbolic partial differential equations typically produces inac- 
curate solutions if ДЕ (or equivalently C) is large. If At is small, i.e. C < 1.0 for the 
convection equation, there is no stability advantage in using implicit methods, 
although there may be an accuracy advantage, as with the Crank-Nicolson finite 
element scheme. 


9.1.5 Linear Convection of a Truncated Sine Wave 


To illustrate the behaviour of the various schemes for obtaining solutions to (9.2) 
we consider the propagation of a sine wave. Thus, (9.2) is to be solved subject to the 
initial condition 


T (x, 0) =зіп(10лх) Іог0<х<01, (9.23) 
=0 for 0.1 <х< 1.0, 
with boundary conditions 
Т(0, )=0 and T(1:1)20 
The exact solution, up to t=0.9/u, is 
T=0 0<х<ш, 
-білГ10лх-ш)| ш<х<ш-01, (9.24) 
=0 #-+0.1<х<1.0. 
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For a value u=0.1 the exact solutions at t=0 and 8 are shown in Fig. 9.2. The sine 
wave propagates with no reduction in amplitude at a speed u=0.1. 

The various schemes described іп Sects. 9.1.2-4 are implemented in program 
TRAN (Fig. 9.8), with «=0. Program TRAN computes solutions to the one- 
dimensional transport equation and is described in Sect. 9.4.3. For the present 
problem the exact solution (9.24) is evaluated in subroutine EXSOL (Fig. 9.9). 

Computational solutions have been obtained with 41 equally spaced points in 
the interval 0€ x<1.0 and with a Courant number C=0.8. The output from 
program TRAN at t=8.0 (i.e. after 40 time-steps) for the upwind scheme (9.10) is 
shown in Fig. 9.2. The computational solution is smooth, but compared with the 
exact solution, appears to have ‘melted’ (or diffused) away; this is consistent with 
the introduction of artificial viscous terms, as in (9.13). 

The solution for the Lax-Wendroff scheme (9.16) is shown in Fig. 9.3 for values 
t=8.0 and С=0.8. It may be recalled that the structure of the Lax-Wendroff 
scheme is equivalent to the FTCS scheme (which is unstable) with additional 
diffusive terms added, where 


add = 0.5 Cud x. 





0.4 
8.2 1 
| 
1 
8.2 0.4 6.4 
Fig. 9.2. Upwind solution for the convection Fig. 9.3. Lax-Wendroff solution for the con- 
equation with C=0.8 vection equation with C=0.8 


The Lax-Wendroff solution is characterised by a primary wave of reduced ampli- 
tude travelling more slowly than the exact solution and the appearance of ancillary 
waves in the ‘wake’ of the primary wave. If the initial condition (9.23) were given a 
Fourier representation and the individual components (modes) propagated and 
reconstructed to generate the Lax-Wendroff solution at t = 8.0, the ‘wake’ would be 
consistent with the short wavelength components travelling more slowly than the 
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dominant component. The trailing wave pattern is referred to as a ‘dispersion’ 
wake. Dispersion will be discussed in Sect. 9.2. 

It may be noted that the upwind and Lax-Wendroff schemes reproduce the 
exact solution if C=1.0, But the implicit schemes, such as the finite difference 
Crank-Nicolson scheme, do not. The Crank-Nicolson finite difference solution at 
t=8.0 with C=0.8 is shown in Fig.9.4. The solution indicates a primary wave 
which is travelling more slowly than the exact solution and a substantial dispersion 


wake. 
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The corresponding Crank-Nicolson finite element solution is slightly more 
accurate than the solution shown in Fig. 9.4; the maximum amplitude is larger and 
the dispersion wake is smaller. However, at C=0.8 and u=0.1 the errors are 
dominated by the 4t? terms. For С=0.1 and larger values of u the fourth-order 
spatial accuracy becomes more important and the Crank-Nicoison finite element 
scheme is considerably more accurate than the other schemes. 

The flow patterns shown in Figs. 9.2-4 will be reconsidered at the end of 
Sect. 9.2. It should be emphasised that a relatively coarse mesh (4x, At) and а 
relatively ‘difficult’ initial condition have been used to highlight some of the 
problems in computing convection-dominated flows. However, it is still true that 
obtaining computational solutions to the (pure) diffusion equation is considerably 
easier than when convection is present. 


9.2 Numerical Dissipation and Dispersion 


In fluid dynamics many phenomena are governed by systems of equations that are 
“hyperbolic” in character; they contain little or no dissipation. Solutions are 
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characterised by wave-trains that propagate with little or no loss of amplitude. It is 
important that numerical schemes do not introduce non-physical dissipation, for 
example as in (9.10) and Fig. 9.2. It is also important that the numerical schemes do 
not alter the speed at which waves propagate; that is, the numerical schemes shouid 
not introduce artificial dispersion, producing the result shown in Fig. 9.4. 

The solution for the propagation of a plane wave that is subjected to both 
dissipation and dispersion can be written as 


T= RIT amp е” Рт! еітіх – 40) , (9.25) 


where Т.р is real and positive, m is the wave-number which is related to the 
spatial wavelength A by A=2z/m. In (9.25), p(m) determines how rapidly the 
amplitude of the wave attenuates and q(m) is the propagation speed of the wave. 

If the motion of a plane wave represented by (9.25) 1$ governed by the linear 
convection equation (9.2), p(m) and q(m) take the values 


р(т)=0 and q(m)-u, 


ie. waves of any wavelength are propagated at the same speed with no damping. 
It is instructive to consider two equations that are closely related to the 
convection equation (9.2); these are 


ат oT T 


ЕП из; 3370 апа (9.26) 
oT OT ӘТ 
or tax Pa . (9.27) 


Equation (9.26) is the transport equation which will be considered in Sect. 9.4. 
Equation (9.27) is the linear Korteweg de Vries equation. For plane waves 
governed by (9.26), the solution (9.25) has the parameter values 


and q(m)=u , (9.28) 


р(т)-ат? 
ie. the amplitude of the wave is attenuated by the diffusive term but the wave 
propagation speed is unaffected. Since т--2л/4, short wavelengths are attenuated 
much more rapidly than long wavelengths. For plane waves governed by (9.27) 


р(т)=0 and q(m)2.u—-fm?, . (9.29) 


ie. the amplitude of the wave 1s unaltered but the wave propagates at a speed 
depending on its wavelength. If waves of more than one wavelength are present 
they propagate at different speeds, ie. they disperse. The change in wave 
propagation speed is more pronounced for short wavelengths (large m). Thus, if В is 
positive short wavelengths travel much more siowly. 

We can now formally define dissipation as the attenuation of the amplitude of 
plane waves and dispersion as the propagation of plane waves of different wave- 
number m at different speeds q(m). Through the role of the higher-order derivatives 
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in (9.26 and 27) we can also associate positive dissipation with the appearance of 
even-ordered spatial derivatives multiplied by coefficients of alternating sign. 
Similarly, a reduction in wave propagation speed can be associated with the 
appearance of odd-ordered spatial derivatives multiplied by coefficients of alter- 
nating sign. 

In examining the consistency (Sect. 4.2) of the discretised equation, e.g. (9.9), it is 
apparent that the computational algorithm, e.g. (9.10), is equivalent to the original 
partial differential equation plus the truncation error, on a finite grid. But the 
truncation error consists, typically, of successive higher even- and odd-ordered 
derivatives. The link between dissipation, dispersion and higher-order derivatives 
in the truncation error will be pursued in Sect. 9.2.2. 


9.2.1 Fourier Analysis 


Quantitative information can be obtained about the numerical dissipation and 
dispersion introduced by a particular computational algorithm by comparing 
Fourier representations of the exact and numerical solutions. Thus, for the specific 
example considered in Sect. 9.1.5, a Fourier representation of the initial condition 
(9.23) would be 


Т(,0- У Т.е". 
Using the separation of variables approach (Sect. 2.5.2) the exact solution at any 
intermediate time is 


o0 
Т(х)= Y T,emewm, (9.30) 
тш--ш 

which will reconstruct the exact solution (9.24). The Fourier components іп (9.30) 
all convect with the same velocity и and аге not subject to any reduction in 
amplitude since there is no diffusive term in (9.2). 

In a parallel manner a Fourier representation can be introduced for the 
solution of the computational algorithm, e.g. the upwind scheme (9.10), in the form 


T (x, t)= Y T, e^ Fm eimix ^am] А (9.31) 
т= —o 
and since the algebraic equation of interest (9.10) is linear the components of the 
Fourier representation can be considered independently. 

During one time-step At the mth component of the approximate solution will 
have decreased in amplitude by exp[ — p(m)4t] and will have advanced a distance 
q(m) At. During the same time-step the mth component of the exact solution will 
have maintained constant amplitude while advancing a distance иде. If the 
computational algorithm is to introduce no artificial dissipation or dispersion, p(m) 
must be zero and q(m) must equal и. 
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Expressions for р(т) and q(m) can be obtained by substituting (9.31) into (9.10) 
and constructing the amplitude ratio of the mth component of the Fourier 
representation for a single time-step, i.e. 


T.e —р(т)( At) gimix- aim) + AOL 
m 


бһ- Т e Pim elmix—a(m)il 
m 


шетірітугіт (mm) at | (9.32) 
But from the substitution into (9.10), the amplitude ratio is 


G,,= 1+ C[cos(mAx)—1]—iCsin(mAx) , (9.33) 


which, not surprisingly, is precisely the same expression (see Table 9.1) as is 
obtained in assessing the von Neumann stability of the upwind scheme (9.10). 
From (9,32, 33), 


|G],,=e 7° 4 = 1—4C(1—C)sin2(0.5 т/х). (9.34) 


For a stable solution p(m)20, so instability may be associated with negative 
dissipation. But (9.34) also gives a measure of how dissipative the particular 
algorithm (9.10) will be. The dissipation parameter p(m) depends both on C and on 
the wave-number parameter mAx. Attenuation is largest for mAx —, ie. for the 
shortest wavelengths. 

The wave propagation speed q(m) is related to the phase ф„ of the amplification 
ratio G,,. From (9.32, 33) 


ф„= — та(т) At 


- Csin(mA x) 
1+ C[cos(n4x) – 1] ' 


-tan^! (9.35) 


фт is the change of phase experienced by the mth component during one time-step 
At. For the exact solution, 


Qa = —mu4dt- —CmAx . 





Thus, 
bm am) ( -1 uan. тота) _ 
НЫ СТ” ‘an а). (9.36) 


The propagation speed (or equivalently, the phase change) is accurate for small 
values of mAx but the error grows as mAx->n. 


If 0« C «05, а(т)<и for mAx large. 
If 05« C«10, q(m-u Юг mAx large. 
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Consideration of the different computational algorithms in Sect. 9.1 leads to 
different expressions on the right hand side of (9.33). In turn different expressions 
for p(m) and q(m) are obtained, through (9.32). 

By introducing a Fourier representation for both the exact and approximate 
solutions, and considering the amplification ratio of the mth component, it is 
possible to obtain expressions for the dissipation and dispersion in the 
numerical scheme for any value оҒт Ах. Morton (1971, pp. 244—245) has used this 
approach for the numerical schemes considered in Sect. 9.1. 

The shortest wavelength that a discrete grid can represent is A=2Ax. This 
corresponds to т4х-л. Long wavelengths correspond to т4х->0. If the 
numerical scheme introduces any dissipation or dispersion it is expected, from 
(9.26-29), that short wavelengths will be much more seriously affected than long 
wavelengths. From a practical viewpoint, it is preferable to model the long 
wavelength behaviour (small m Ax) accurately and to check that any spurious short 
wavelength behaviour is physically or numerically attenuated. This strategy 15 also 
consistent with the analysis of Sect. 3.4.2. 


9.2.2 Modified Equation Approach 


We have seen that dissipation and dispersion can be associated with the occurrence 
of second and third spatial derivatives, as in (9.26 and 27). This suggests that an 
examination of the truncation error may also permit the dissipative and dispersive 
tendencies of a particular computational algorithm to be identified. However, the 
truncation error must be interpreted in a special way. 

It is assumed that L(7)=0 represents the differential equation and L,(7)=0 
represents the algebraic equation, e.g. (9.9). Therefore, the previous development of 
the truncation error Ej can be written 


L(T)-L,(T)- EX(T)-0 . (9.37) 
Now, the following expression for the truncation error must be obtained: 
L,(T)-L(T)- E(T)20 , (9.38) 


where the algebraic equation has been expanded as a Taylor series in the ap- 
proximate solution. Equation (9.38) is manipulated so that E(T) contains only 
spatial derivatives. For this purpose (9.38) 15 repeatedly differentiated to eliminate 
pure and cross derivatives in t. This is in contrast to the situation for (9.37) where 
L(T)=0 may be used to simplify the truncation error. 

The equation L(7)+ E(7)=0 is a differential equation with an infinite number 
of terms. This is called the modified equation (Warming and Hyett 1974) and 1$ the 
differential equation that the algebraic equation would solve if sufficient boundary 
conditions were available. The modified equation can be used to demonstrate 
consistency and the order of accuracy of the computational algorithm, as before. In 
the modified equation odd-order derivatives are associated with dispersion and 
even-ordered derivatives are associated with dissipation. Therefore, by considering 
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the lowest odd- and even-order terms in the modified equation we can deduce the 
dissipative and dispersive properties of the algebraic scheme, but only for long 
wavelengths. This is because the modified equation is based on a Taylor expansion 
that assumes small 4t and 4x. However, as noted above, it is the long wavelength 
behaviour that is generally of greater interest. 

Using the modified equation approach it is possible to show that the Lax- 
Wendroff scheme (9.16) applied to (9.2) has as the leading terms in the truncation 
error 


Ax? OT (Ax? 2047 


As noted previously, the Lax-Wendroff scheme is consistent with (9.2) to 
O(At?, Ax?). The above truncation error indicates second-order dispersion and 
third-order dissipation. The modified equation approach has been used to generate 
the truncation errors shown in Tables 7.1, 9.1 and 9.3, so that the numerical 
dissipation and dispersion associated with long wavelengths (small тх) can be 
assessed. 

The modified equation analysis has the advantage that it extends to nonlinear 
equations, whereas Fourier analysis (Sect.9.2.1) is only applicable to linear 
equations. Klopfer and McRae (1983) use a modified equation analysis to con- 
struct generalised Lax-Wendroff schemes for one-dimensional unsteady propa- 
gation of shocks, which are governed by the Euler equations (10.40). 

For the equations governing fluid dynamics (Chap. 11) the analysis of compu- 
tational algorithms using the modified equation approach requires a substan- 
tial amount of algebraic manipulation. Warming and Hyett recommend a sys- 
tematic tabulation to keep track of individual terms. It is expected that symbolic 
manipulation using a computer, e.g. ма MACSYMA, would be very effective in this 
area. The possibilities for symbolic manipulation in computational fluid dynamics 
are discussed by Steinberg and Roache (1985). 


9.2.3 Further Discussion 


А consideration of the truncation errors shown in Table 9.1 for the upwind, Lax- 
Wendroff and Crank-Nicolson schemes applied to the example at the end of 
Sect. 9.1 is facilitated if attention is focussed on the Fourier components that make 
up the solution. The results shown in Figs. 9.2-4 and Table 9.1 indicate that 
excessive numerical dissipation effectively obliterates the small dispersion errors 
for the upwind scheme and attenuates some of the dispersion errors in the Lax- 
Wendroff solution. The Crank-Nicolson schemes are dominated by dispersion 
errors which act to significantly slow down the propagation speed of individual 
Fourier components. Although the Crank-Nicolson schemes are stable for 
increasing Courant number the dispersion errors grow like C? (Table 9.1). For 
small C, and hence small At, the dispersion error of the Crank-Nicolson finite 
element scheme (Table 9.1) is considerably smaller than that of the other schemes 
considered in Sect. 9.1. 
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The use of Fourier analysis and the modified equation analysis permits the 
construction of more accurate schemes designed to reduce dissipation and dis- 
persion errors. This can be illustrated with the Taylor-Galerkin (Donea 1984) finite 
element method. In this method (9.2) is first discretised in time to give 

T""—-T" T 


apo (9.39) 


However, a more accurate representation of 07/0t at time level n can be written 








ӘТ _Т"*1—г" Т At? дзТ| 

—) == 0 54t— | –0(413). 

дї At 0541552 6 дг? (Ағ) 
From (9.2), 

GT 202Т апа OT 2 д? oT 

д" ду? д“ dx? at ` 


Therefore а more accurate [than (9.39)] semi-discretisation of (9.2) is given Бу 


и? At? 92 ТТТ" oT QT 
- —[——————4u—--—05u^ A4t—.-20. 9 40 
| 6 24 At Tux н EE (940) 





The application of а uniform-grid, linear interpolation Galerkin finite element 
formulation (Sect. 5.4) to (9.40) gives 


С: 
|м.- = айы rr рТ") +САхІ,Т"—0.5 С?Ах?І,,Т'=0, (941) 


which is а generalisation of the Lax-Wendroff scheme (9.16). Тһе mass operator 
M,={@, 3,8, arises from the finite element formulation. The term (C2/6)L,., 
(Т1*®— Т") appears due to retaining additional terms in the Taylor series 
expansion of 27/21. Equation (9.41) has a truncation error of 0(443, 4х3), which 
may be contrasted with the second-order truncation error of the Lax-Wendroff 
scheme (9.16). Donea (1984) has examined the dissipation, dispersion behaviour via 
a Fourier analysis (Sect. 9.2.1) and finds that (9.41) is superior to the Lax-Wendroff 
scheme for all wavelengths and for Courant numbers in the interval 0<С < 1. 

The above strategy can be generalised by adding in controlled (anti-)dissipation 
and (anti-)dispersion as part of the Taylor series construction. Baker and Kim 
(1987) develop such a formulation and call it the Taylor- Weak statement (T WS) of 
the Galerkin finite element formulation. The TWS algorithm has four disposable 
constants. Specific choices of these parameters reproduce 17 well-known finite 
difference and finite element methods. An optimum choice can be made, in 
principle, for the linear convection equation (9.10) However, for more general 
nonlinear hyperbolic conservation equations, e.g. the Euler equations (10.40), an 
optimum choice will be solution dependent, aithough desirable dissipation, 
dispersion attributes can still be obtained in a gross sense. 
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We may infer the general guideline that first-order formulae for derivatives 
should be avoided. A first-order representation for the nth derivative in the 
governing equation will generate spatial derivatives of order (n+ 1) and higher in 
the modified equation (which is the equivalent governing equation actually solved). 
This is particularly serious for the convective terms (п 1), since the introduction of 
spurious second or third derivatives can change the character of the solution 
significantly. 


9.3 Steady Convection-Diffusion Equation 


For many flow problems dissipative mechanisms are only significant in a narrow 
layer, typically adjacent to a boundary. Computational solutions obtained with 
grids appropriate to the main flow region are often oscillatory when the true 
solution changes rapidly across the boundary layer. The steady convection- 
diffusion equation is a useful model equation with which to illustrate the 
phenomenon. 

This equation can be written in one dimension as 


t a —-0. | 
“2-4 (9.42) 


И represents the steady-state balance between convection and diffusion; time plays 
no role at all. If (9.42) is combined with boundary conditions 


T(0-0 and T(D-10 (9.43) 
the exact solution is obtained in the interval 0E x € 1 as 
T— [e'** — 1.0]/[e"* — 1.0] , (9.44) 


which is shown in Fig. 9.5. The solution is characterised by a uniform distribution 
of T combined with a boundary layer adjacent to х= 1.0. 


1:0 m Ах=@ .2, R 124 
o Аха0.1, MEE | 
І А Ахаб.й5,К у= ! 
--- exact, i 
ч/«ж28 H 
0:5 ! 


d 
Ою-А-о-А-Ю-А-О-А- лево. о-А-О 


Fig. 9.5. Cell Reynolds number influ- 
ence on the solution of the convection 
diffusion equation 
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9.3.1 Cell Reynolds Number Effects 


If centred three-point finite difference expressions are substituted for the derivatives 
in (9.42), the following algorithm is obtained: 


-(1 +0.5 К.) T;- 1 T2T;-(1 — 0.5 К) Т). 1 =0 А (9.45) 


where К. п=и4х/а, ie. К, is a Reynolds (strictly Peclet) number based оп the 
characteristic length 4x. 

A Taylor series expansion of (9.45) about the jth node indicates that it is 
consistent with (9.42) with a truncation error of О(4х2), Equation (9.45) is implicit 
but when equations are obtained for all nodes a tridiagonal system of equations is 
obtained which сап be solved economically (Sect. 6.2.2). 

For this relatively simple case, it is also possible to write an exact solution of 
(9.45) as 


(9.46) 


1+0.5 В j 
Tym AB 10329} 


1 — 0.5 Ree 


where A, and B, are chosen to satisfy the boundary conditions (9.43). For the case 
ија = 20 and Ax = 0.05, 0.1 and 02, solutions to (9.46) are shown in Fig. 9.5. The 
solution for 4x —0.05, which is equivalent to К, — 1, is in reasonable agreement 
with the exact solution (9.44). The solution with the intermediate grid, К. — 2, is 
accurate except іп the boundary layer adjacent to x = 1. However, the solution on 
the coarsest grid, К —4, is not only inaccurate but also oscillatory. 

From (9.46) it can be seen that oscillatory solutions are avoided if К, x 2. It is 
also instructive to relate the solution behaviour shown in Fig. 9.5 to the eigenvalues 
of the matrix of equations (9.45) when combined with boundary conditions (9.43). 
This system has the form 


AT-B, (9.472) 
i.e, 
b Č Т, — aT, 
abe T; 0 
= у 5 (9.47b) 
abe Ту 0 
а b Т,- 1 = cT; 


where а= — (1 +0.5 Б.) b=2 and c2 – (1—0.5 К). The eigenvalues of the 
above tridiagonal matrix are related to a, b and c as 

Àj b ао)“ cos( 7 j= 

j= + (ac) cos ісі , j=1, 23,...,4—2 . (9.48) 
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It can be seen that the condition ac=0 is required for the eigenvalues to be real. 
Substituting for a and c gives the condition 


(1 + 0.5 Re) — 0.5 Коеп) > 0 or К.п = 2 к (9.49) 


Thus, for this example, the oscillatory solutions coincide with the occurrence of 
complex eigenvalues. 

For A, in (9.47a) of general form, bounds on the eigenvalues 4 are given by the 
Gershgorin circle theorem (Jennings 1977, p. 35) 


N 
|A — aul £ У lai] . 
j*i 

That is, the eigenvalues of A lie in the union of circles associated with each row 
of A. The centre of each circle is given by the diagonal entry а, and the radius by 
the sum of the magnitudes of the off-diagonal elements in each row. Carey and 
Sepehrnoori (1980) use the Gershgorin circle theorem to analyse (9.47a) for 
different values of R,.,,. In the range 0< К. € 2 the centre of the Gershgorin circle 
les at {—2а/4х?, 0) in the complex plane with radius 2a/4x?, and all 
eigenvalues Пе on the negative real axis. As R,, increases, i.e. as а decreases, the 
radius reduces and the centre of the circle moves closer to the origin. At R,.,= 2 the 
centre is at (C u/Ax, 0) with radius u/Ax =2a/Ax?. For К > 2 the radius remains 
constant at u/Ax but the centre migrates to the origin and the eigenvalues are 
complex. When К, = оо, the circle centre is at the origin and the eigenvalues are 
purely imaginary. . 

The limitation R.e £ B for non-oscillatory solutions, applies to other methods, 
such as the finite element method. Finiayson (1980, p. 241) provides values of B for 
finite element and orthogonal collocation methods. However, B is typically less 
than 10. 

It should be emphasised that oscillatory solutions may occur if К, > B. 
Whether they actually do occur also depends on the flow geometry and imposition 
of boundary conditions. For example, decelerating flow ahead of an obstruction 
often produces an oscillatory solution if R,,,7 B. However, oscillation-free 
solutions for uniform or accelerating flow can be obtained even though К> B. 

As might be expected, the replacement of the centred finite difference represen- 
tation for 07/0х іп (9.42) with the upwind finite difference. expression 
(T;,— T, 1)/Ах prevents the oscillations from occurring. The upwind scheme 


produces the following algorithm in place of (9.45): 
—(1 + Ren) Tj- 1 + 2(1 + 0.5 Roen) Т;-- T 54 17 0 . (9.50) 


Consideration of (9.48) shows that the upwind scheme has real eigenvalues for all 
values of Кс. The solution for the upwind scheme with u/a=20 and 4x —0.2 is 


Shown in Fig. 9.6. As expected, the solution is not oscillatory but is not very 


accurate either. 
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РО . Fig. 9.6. Upwind solution of the con 
o Ax-8.2, R- =4, (upwind) | vection diffusion equation 
T a Ax*8.2,R... 1 1^4, (center) ! 
--- exact, | 





0:5 1-0 


А Taylor series expansion of (9.50) indicates that it is consistent with (9.42) to 
O(Ax) only. If (9.50) is treated as being accurate to O(Ax?) it is found to be 
consistent with the equation 


dT d*T 
uq. «d +0.5 К еп) 2170 , (9.51) 
i.e. the use of the first-order upwind difference formula has introduced ап artificial 
diffusivity 0.5R, ua. This may be compared with the artificial diffusion introduced 
by the upwind scheme for the convection equation, see (9.13). For accurate 
solutions a consideration of (9.51) would suggest the practical limit 


0.5К 41, 


which is more restrictive than (9.49). 

If a Neumann boundary condition is introduced in place of the Dirichlet 
boundary condition at x=1.0 the character of the solution is not radically 
altered. For large values of и/х the centred-difference scheme (9.45) produces 
solutions with even greater oscillations than those in Fig. 9.6. 


9.3.2 Higher-Order Upwind Scheme 


The very oscillatory behaviour of the three-point centred difference representation 
for 4Т/4х in (9.42) and the very dissipative nature of the two-point upwind scheme 
(9.50) suggests that a four-point representation of d T/dx will be necessary to obtain 
satisfactory results. For и positive the following representation would be 
appropriate: 


dT 
dx 812-2 tbT;- +СТ;+ Т) . (9.52) 
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Following the general procedure of Sect. 3.2.2 the following scheme is obtained: 


dT Т,1- і-і а(Т,-2-ЗТ,-,+3Т,-Т;+ 1) 2 
dx 24x бозы mf, (9.53) 


For и negative (9.53) is replaced by 


dT Т,,1-Т,- 4(1,-1-37;%3Т;,1-Т,,;) 2 
dx ^ 24x 34x tolx) 


Equation (9.53) has been deliberately written as a modification to the three-point 
centred finite difference representation. The parameter 4 controls the size of the 
a алмаи 
modification. 
A Taylor series expansion about node j indicates that 
dT d*T | 


qx «Т 054 xta 


T;-,—3Tj;.,43T;— паз 4% 
That is, the modification can be used to counteract specific terms іп the Taylor 
series expansion for the complete equation. In particular, the choice 4=0.5 
eliminates the 4x^d? T/dx? term and makes (9.53) of О(Ах2). Based on the 
association of wiggles with dispersion and with odd-ordered derivatives (Sect. 9.2) 
it can be anticipated pat 9 53) with q—0.5.combined with the previous 

iscretisation 0 x produce a more accurate solution, with fewer 


wiggles, than (9.45). _ 
Using (9.53) gives the following implicit algorithm after discretising (9.42): 


3 Кеп Ty-2—[1 +(9+0.5) Кы] Ty-1 +(2+4 Reew) Ty 
-|! + (3 -05 Ras | T,,-0, (9.54) 


which reduces to (9.45) if q — 0. The structure of (9.54) introduces the complication 
of a quadridiagonal matrix to be solved. This is achieved in an efficient manner by 
using the generalised Thomas algorithm (Sect.6.24) with f—0. The required 
codin xdi em to subroutines РАС Fig. 6.18) апа BANSOL (Fig. 6.19), is 

Equation (9.54) is combined with boundary conditions (9.43) to produce a 
computational solution to be compared with (9.44), Typical results on a coarse 
grid, Ax 20.2, К = 4, for various values of q are shown in Table 9.2. The case 4-0 
coincides with the three-point centred scheme (9.43), which produces the oscillatory 
solution plotted in Figs. 9.5 and 9.6. 

The case а= 0.5 is considerably more accurate although still slightly oscillatory. 
Increasing q is seen to produce a smoother solution but a more diffuse solution 
similar to the solution produced by the two-point upwind scheme (9.50) shown in 
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Table 9.2. Coarse grid solutions of (9.42, 43) using (9.54) 


x 0. 0.2 0.4 0.6 0.8 10 

Exact solution 0. 0.0000 0.0000 0.0003 0.0018 1.0000 
q=0 0. 0.0164 — 0.0328 0.1148 — 0.3279 1.0000 
4-0.5 0. 0.0000 — 0.0003 0,0058 — 0.0760 1.0000 
4-1.0 0. 0.0000 0.0002 0.0036 0.0597 1.0000 
4- 1.5 0, 0.0004 0.0032 0.0222 0.1491 1.0000 


Fig. 9.6. On a finer grid the case 4:0. produces the most accurate solutions (not 


shown). 
The equivalent ordinary differential equation actually solved by (9.54) can be 
obtained by expanding as a Taylor series about node j. The result is 





dT Ах АТ 1-2 4x 4T 
dx Ки dx? 1^6 ах? 
1 \ Ax3d*T Ах“ d^T 
LLL £85 1-109 5 ...=0. | 
«(n zc) 12 ae tl 04) 120 ах? 0 9.55) 


For practical problems with typically small values of a/u, R,., is O(1) or larger. 
This implies that the formal truncation error of representing the individual 
derivatives in (9.42) may have different influences on the solution accuracy of (9.54). 
Thus for 4 0.5, the third term іп (9.55) is likely to have the dominant influence on 
the solution error. This suggests for problems with small diffusivities « it is 
appropriate to represent the convective terms, like ud T/dx in (9.42), more ac- 
curately than the diffusive terms. 

The results shown in Tabie 9.2 are consistent with (9.55). An increasing q value 
is seen to introduce positive dissipation. However, it should be remembered 
(Sect. 9.2) that (9.55) is more relevant to the long wavelength behaviour. Equation 
(9.55) indicates that (9.54) with 4=0.5 and R,,,—1.0 is formally fourth order. 
However, for nonlinear convective problems К will depend on the local solution 
so that it will not be possible to achieve fourth-order accuracy throughout the 
computational domain. The particular choice 4--0.375 has been analysed by 
Leonard (1979) and is considered in relation to a two-dimensional problem in 
Sect. 17.1.5. 

It is apparent from the analysis of Sects. 9.2 and 9.3 that odd-ordered spatial 
derivatives in the truncation error are associated with wiggles in the solution 
whereas even-ordered spatial derivatives are associated with dissipation or un- 
stable growth depending on the sign of the multiplying coefficient. The tendency to 
cause wiggles without loss of amplitude will be referred to as a dispersion-like 
phenomenon even though the concept of dispersion is strictly only meaningful for 
unsteady flows. If a given steady flow is interpreted as the culmination of a 
transient flow (with steady boundary conditions) after a very large time, any 
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dispersion-like wiggles can be thought of as emanating from the balance between 
dispersion effects and the imposition of boundary conditions. 


9.4 One-Dimensional Transport Equation 


The one-dimensional transport equation can be written 
oF от эт. 

Or ax “ax?” 
where T is a passive scalar (e.g. temperature) which is being convected with a 
known velocity u(x, t) and being diffused (Fig. 1.7). И T is the temperature then a is 
the thermal diffusivity. To examine the behaviour of computational solutions of 
(9.56) we will assume that и and « are constants. Like the diffusion equation 
(Chap. 7) the transport equation is strictly parabolic (Sect. 2.3) and requires the 
same type of boundary and initial conditions, for example (7.24), as the diffusion 
equation. 

However, when и/х becomes large the first two terms in (9.56) would be 
expected to dominate. Then the transport equation will demonstrate behaviour 
similar to the convection equation (9.2). It may be recalled that exact solutions of 
the convection equation are typically wave motions that propagate with no 
damping (reduction in amplitude), Therefore, for large values of u/a in the 
transport equation (9.56), solutions may be expected to demonstrate wave-like 
motion that is slightly damped. 

From a consideration of the convection diffusion equation, which represents a 
steady-state limit of the transport equation, we may also expect spatial wave-like 
motions to appear in approximate solutions of (9.56) if symmetric three-point 
algebraic representations are used for the convective term with large values of u/c. 

In this section we examine some of the algebraic schemes (FTCS, DuFort- 
Frankel, etc.), that have been investigated previously for the diffusion (Chap. 7) and 
convection (Sect. 9.1) equations, to see if the simultaneous appearance of con- 
vection and diffusion introduces any new difficulties. 


(9.56) 


9.4.1 Explicit Schemes 
The FTCS scheme applied to (9.56) produces the algebraic equation 


Tj—T] wWTj,- TI.) «(T)-i- 27) eT). 


At 2Ах i Ax? -0, (9.57) 
which, as ап algorithm, can be written 
T5*!=(s+0.5C) Т"-,+(1- 25) Т7%-(5-0.50)7%;і., (9.58) 


where s=aAt/Ax* and C=udt/Ax. 
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A Taylor series expansion about the (/, n)-th node indicates that (9.58) is 
consistent with (9.56) with a truncation error of O(At, Ax?). If (9.58) is treated as 
being of O(At?, Ax?) it is consistent with 


oT OT OT At2T 

и gy 4 = .5 
пх “dx? 2 OP’ 9.5?) 
i.e. an additional term, previously interpreted as the leading term іп the truncation 
error, is included in the partial differential equation that (9.58) is consistent with. 
Following the modified equation approach (Sect. 9.2), time derivatives can be 
eliminated in favour of spatial derivatives to give 


oT | OT QT At? Ax NET 
дх? 


— ии и 
+и (a a) audAt+u 3 и 6 





4х? , AtAx?\a4T 
+ (оз adem au? arenas нарай с aso. (9.60) 


where 2 = и? 41/2. Thus, it can be seen that the use of a first-order time differencing 
scheme in the transport equation introduces first-order dissipation and dispersion. 
For large values of u/« the artificial diffusion term 222 T/Ox? can easily be 
comparable to the physical diffusion term unless the magnitude of At is limited. 
Unless д“Т/дх“ is large, the last term in (9.60) is unlikely to be significant. 
Therefore, an accuracy limitation on At is 


24 C\ 2 
41 <; ОГ C? «2s ОГ кы(==)<г " (9.61) 


Clearly, there is a strong incentive to use second-order time differencing when 
solving the transport equation. 

It may be recalled that forward time differencing combined with centred spatial 
differencing produces an algorithm that is conditionally stable for the diffusion 
equation and always unstable for the convection equation. For the transport 
equation the von Neumann stability analysis produces the value for the amplifi- 
cation factor С shown in Table 9.3. For stable solutions, |G| S1 implies the 
following restrictions (Noye 1983, p. 215): 


0<С2<28<1. (9.62) 


Equation (9.62) includes the ‘diffusive’ limit (бесі. 7.1.1). Strictly, (9.62) allows 
solutions for which Roe — uAx/a — C/s- 2.0, so that oscillatory solutions may be 
expected. Also, from (9.61), such solutions would be inaccurate. 

If a centred time difference expression is combined, as in the Richardson 
scheme (7.8), with a centred spatial difference expression to represent (9.56), the 
resulting algorithm is unconditionally unstable unless s=0 which degenerates to 
the leapfrog scheme (9.15) for the convection equation (9.2). 
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The DuFort-Frankel scheme, applied to the transport equation, is 


Ti-Ti} (T5.1— T5-1) 
2447 1" 24x 


_«[Тў-1-—(Т)1+Тў')+Т,1]_ 


L3 0. (9.63) 


The von Neumann analysis indicates that if the Courant number С 5 1 there is no 
restriction on s. À Taylor series expansion about the ( j, n)-th node indicates that, in 
contrast to the diffusion equation, it is necessary to ensure that At« Ax (C? «1 in 
Table 9.3) if (9.63) i$ to be consistent with (9.56). This is a very severe restriction. 

If an upwind difference expression for 27/0х replaces the centred difference 
representation in (9.57) the result, as an algorithm, can be written, for positive u, 


T'*!-(s-C)T! ,4(0—-2s- C) T25T7,, . (9.64) 


A Taylor series expansion indicates that (9.64) is consistent with (9.56) to 
O(At, Ax). The truncation error for the upwind scheme shown in Table 9.3 
indicates that an artificial diffusivity 


а 20. 5uAx(1— С) 


has been introduced. This same term appeared for the convection equation 
(Table 9.1). For the upwind scheme to generate accurate solutions of the transport 
equation it is required that 


2 
! К — > . 
a<a or сеп <7 с 
The application of the von Neumann stability analysis to (9.64) gives the value 
for the amplification factor G shown in Table 9.3. The necessary condition for 
stability, |G| $1, leads to the requirement 


25-С<1, (9.65) 
which is equivalent to 


0.5 Ax? 
5------- 9.66 
шү: , (9.66) 


cell 


which is considerably more restrictive than the limit for the diffusion equation 
(Table 7.1). 

The major restrictions for the schemes considered so far are associated with the 
need to obtain accurate solutions when the derivatives are discretised with first- 
order approximations. The Lax-Wendroff scheme (9.16) provides a means of 
achieving second-order accuracy in both time and space. For the transport 
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equation (9,56) the Lax-Wendroff scheme can be interpreted as ап FTCS scheme 
with a modified diffusivity, «* =a(1+0.5 СК). As R,,,,— оо the truncation errors 
and stability restrictions revert to those given in Table 9.1. 


9.4.2 Implicit Schemes 


For the diffusion equation implicit schemes are effective in removing the stability 
restriction s<0.5. Here, the most effective one-dimensional two-level scheme for 
the diffusion equation, the Crank-Nicolson scheme, will be applied to the transport 
equation. Strictly, the present scheme should be described as a trapezoidal scheme, 
since the original Crank-Nicolson scheme was developed for the diffusion 
equation. However, the labels Crank-Nicolson and trapezoidal will be used 
interchangeably to refer to any two-level scheme that evaluates the spatial 
derivatives symmetrically about the (n+1/2)-th time-level. 
For the transport equation the Crank-Nicolson scheme gives 


Tj''-Tj (T5.1— Т%- 1) Titi Т"+1 
Ac 109 д 24х 


I*' ,-2T'5- T1" T"*1-2T"*l. T" 1 
-osaa tT TEA ET o (9.67) 
which can be written as an algorithm 
—(s+0.5C) T5*1 c1 +s) T5*! —(s-0.5 C) T3} 
-(58--0.5С)7;-,--21-5)Т7%-(5-050)7%,і. (9.68) 


А Taylor series expansion of (9.68) about the (j, n)-th node indicates that it is 
consistent with (9.56) with a truncation error of O(At?, Ax?). Thus, the problem of 
artificial diffusion arising from first-order truncation errors is avoided. 

А von Neumann stability analysis of (9.68) produces the expression for the 
amplification factor G shown in Tabie 9.3, with no restriction on the magnitude of 
C or s for stability. But for solutions to be spatially non-oscillatory, the restriction 
Reen 2 is required. 

For the long wavelength components (small m4x) of the solution the dis- 
sipation and dispersion properties of computational algorithms for the transport 
equation can be inferred from the truncation errors listed in Table 9.3. Generally, 
the implicit schemes behave well. The physical dissipation in the system reduces the 
gross dispersion errors associated with short wavelengths (тАх->л) that occur 
with the convection equation. In particular, the finite element Crank-Nicolson 
scheme, shown in Table 9.3, has small dissipation and dispersion errors for smail 
values of C. To obtain information about the short wavelength components it is 
necessary to carry out a Fourier analysis (Sect. 9.2.1). This technique is applied to a 
convecting temperature front in the next section. 
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9.4.3 TRAN: Convection of a Temperature Front 


In this section some of the schemes described in Sects. 9.4.1 and 9.4.2 will be applied 
to the problem of a temperature front convecting with a velocity и through a fluid 
of thermal diffusivity «. For large values of the cell Reynolds (strictly Peclet) 
number, Ree = 44 x/a, the front width is restricted to a few grid spacings (Fig. 9.11) 
and all of the schemes described in Sects. 9.4.1 and 9.4.2 produce oscillatory 
solutions primarily associated with dispersion errors (Sect. 9.2). 

Consequently, two new schemes will be developed here that produce much 
better dispersion-related behaviour. Both schemes will be based on Crank- 
Nicolson time differencing. The first scheme is an extension of the Crank-Nicolson 
mass operator scheme (9.20) that was applied to the linear convection equation 
(Sect. 9.1.5), For the one-dimensional transport equation (9.56) this becomes 


T"* 1.. T" 
M (E Dese L,,)05(T" 4 T**!)z0, (9.69) 


where M, = (0, 1 — 26, 5} апа L, and L, are the conventional three-point centred 
difference formulae. The parameter ó will be chosen to reduce dispersion errors. It is 
clear that (9.69) will produce a tridiagonal system of equations that сап be solved 
efficiently. A von Neumann stability analysis indicates that (9.69) is stable as long as 
650.25. 

Using the modified equation approach (Sect. 9.2.2) indicates that (9.69) is 
solving the following equivalent equation, as far as the longwave behaviour is 
concerned, 


2 2 3 
aT aT PE sua! C шт 





dt дх ax? 6 12 дх? 
1 С: o^T 
— 1 —— — — = 
adx (5+ 4 г) Эха +...=0. (9.70) 


Formally, (9.69) is a second-order scheme. The choice ô= 1/6+ C?/12 is seen to 
suppress the lowest-order dispersion term in the truncation error. If, in addition, 
С2<0.5, the lowest-order dissipation term in the truncation error introduces 
positive dissipation. However, the optimal choice for ó requires C € 1.0 for stability. 
An alternative means of obtaining better dispersion behaviour is to use the 
four-point upwind finite difference Crank-Nicolson scheme 
T 1. T" 


7; 1+ (ит. 9 Г.) 0.5(Т"+Т"+1)=0, (9.71) 


where, for u>0 


(7,1-1;-1) 4907-3 -3Ti- %37,-7,,1) 


9.72 
Ах 34х 0.72) 
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Equation (9.72) is the four-point convective operator introduced in conjunction 
with the steady convection-diffusion equation, Sect. 9.3.2. 

Incorporation of (9.72) into (9.71) produces a quadridiagonal system of 
equations to be solved at each time-step, This requires using the generalised 
Thomas algorithm (Sect. 6.2.4). In program TRAN the generalised Thomas al- 
gorithm is implemented by making an additional forward sweep (lines 126-133) 
to render the system of equations tridiagonal. For the current application the 
generalised Thomas algorithm requires approximately 80% more operations than 
the conventional Thomas algorithm. The scheme represented by (9.71, 72) is stable 
as long as 42 — 3/R,,,,. This is not a practical restriction since generally only non- 
negative values of q are of interest. 

The equivalent modified equation (Sect. 9.2.2) to (9.71) is 


oT | oT eT 4» (E29 ауы 





a "Ox ax? 6 12 / 2х3 
u L (Са 4 75. 0. 7 
и —( Ge xe 0 (9.73) 


In (9.73) иАх?/ К = &Ax?. For flows for which Ren > 1 the dispersive term is seen 
to be the largest term in the truncation error. This term can be eliminated by the 
choice q—0.5 --0.25C?. For this choice of д and large values of Кен the lowest- 
order dissipative term introduces positive dissipation. 

The above schemes, (9.69, 71), and the schemes developed in Sects. 9.4.1 and 
9.4.2 are applied to the propagating temperature front problem, Fig. 9.7. At t=0 
a sharp front is located at x 20. For subsequent time the front convects to the 
right with a speed и and its profile loses its sharpness under the influence of the 
thermal diffusivity «. Consequently, for a given value of t, the larger the value of 
К..и= uAx/a, the sharper the profile of the front. 


Fig. 9.7. Initial conditions for propagating 


temperature front 
-20 оо 


The governing equation for this problem is (9.56). For sufficiently small values 
of time the following are suitable boundary conditions: 

Т(– 2, )=1.0, Т(2,0-00. (9.74) 
An exact solution, by the separation-of-variables technique, is 


_ 2. (x—ut) ]ехр[—а(2&—1)?°л?/12] 
Т(қ9-05-2 У, sin [ок-) 60 mero ed 


Есі 


(9.75) 
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The various schemes, which are implemented in program TRAN, are 
indicated in Table 9.4. A listing of TRAN is provided in Fig. 9.8 and the main 
parameters are described in Table 9.5, The exact solution (9.75) is evaluated in the 
subroutine EXSOL (Fig. 9.9). Typical output from program TRAN, produced by 
the Lax-Wendroff scheme, is shown in Fig. 9.10. 


Table 9.4. Various schemes implemented by program TRAN 





ME Description 


FTCS scheme (9,58) 

Lax-Wendroff scheme (9.58) with s* =s+0.5C? 

Explicit four-point upwind scheme (9.57, 72); EX-4PU 
Crank-Nicolson, finite difference; ô =0, а = 0; CN-FDM 
Crank-Nicolson, finite element; д = 1/6, а=0; CN-FEM 


b&b bh hw м 


Crank-Nicolson, mass operator; ô= 1/6 + C?/12, а = 0; CN-MO 
Crank-Nicolson, four-point upwind; ó —0, q—0.5--0.25C?; CN-4PU 


Table 9.5. Parameters used in program TRAN 


Parameter Description 
ME =1, FTCS scheme (9.58) 
ME = 2, Lax-Wendroff scheme (9.16) 
ME = 3, explicit four-point upwind scheme (9.57, 72) 
ME = 4, general Crank-Nicolson scheme (9.69, 71) 
JMAX number of points in the interval 0E x € 1 
NTIM number of time steps 
TIMAX time of comparison with exact solution 
JPR = ], propagating sine-wave; 
` =2, propagating temperature front 
NEX N in (9.75), used in EXSOL 
LN L in (9.75), used in EXSOL 
C Courant number (9.11) 
U u in (9.2, 56) 
ALPH thermal diffusivity a in (9.56) 
$ $—a4At/Ax? 
RCEL cell Reynolds number, udx/a, Sect. 9.3 
Q q in (9.53, 72) 
EM ó in (9.69) 
DT,DX,X At, Ах, x 
T dependent variable (9.2, 56) 
TEX exact solution, (9.24 or 75) 
AA, BB, CC coefficients multiplying 7%. 1, Т) and Т”, , for explicit schemes 
AE, BE, CE coefficients multiplying T^*1, Т"*! and 7341 for implicit schemes 
А elements іп the tridiagonal matrix (9.18) 
BANFAC factorises A into upper triangular 
BANSOL solves 4 T=R for Т^?! 
R right-hand side of tridiagonal scheme (9.22) 
TD solution for Т"*! on return from BANSOL 
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1c 
2c TRAN SOLVES THE LINEAR TRANSPORT EQUATION USING 
3c VARIOUS EXPLICIT AND IMPLICIT SCHEMES 

4c 

5 DIMENSION R(65),T(65),TD(65), TEX(65) , A(5,65) 
6 1,D(4),X(65) 

7 OPEN(1,FILE='TRAN.DAT’) 

8 OPEN(6,FILEs ' TRAN . ОПТ’) 

9 READ(1,1)ME, JMAX, ИТІМ, МЕХ, JPR,LW 

10 READ(1,2)C,U,S,Q, EM 

11 1 FORMAT(6I5) 

12 2 FORMAT(2F5.2,3E10.3) 

13 С 

14 IF(JPR .Ей. 1)WRITE(6,3) 

15 IF(JPR .EQ. 2)WRITE(6,4) 

16 3 FORMAT(* PROPAGATING SINE-WAVE') 

17 4 FORMAT(? PROPAGATING TEMP-FRONT') 

18 JMAF = JMAX - 2 

19 JMAP = JMAX - 1 

20 AMP = JMAP 

21 DX = 1.0/AMP 

22 IF(JPR .EQ. 2)DX = 4./AMP 

23 DT = C*DX/U 

24 EL = Ш 

25 ALPH = S*DXIsDX/DT 

26 IFC(ALPH .LT. 1.0E-10)ALPH = 1.0E-10 

27 RCEL = UsDX/ALPH 

28 QQ = Q*C/3. 

29 IF(ME .LT. 3000 = 9. 
30 € 
31 MQ = 0 
32 IF(ABS(QQ) .GT. 0.0001)MQ = 1 

33 ATIM = NTIM 

34 ТІМ = 0. 

35 TIMAX = DT*ATIM 

36 C 

37 IF(ME .Ей. 1)WRITE(6,5)ME 

38 IF(ME .EQ. 2)WRITE(6,6)ME 

38 IF(ME .EQ. 3)WRITE(6,7)ME 

40 IF(ME .EQ. 4)WRITE(6,8)ME 

41 5 FORMAT(’ ME =°,12,, ЕТС5 DIFFERENCING’) 
42 6 FORMAT(' ME -*,I2," LAX-WENDROFF ' ) 

43 7 FORMAT(' ME -?,12,? EXPLICIT АРТ UPWIND') 
44 8 FORMAT(^ ME -?,12,: GENERAL CRANK-NICOLSON?) 
45 WRITE(6, 9) JMAX,NTIM,C,U,DX,DT 
46 9 FORMAT(’ 2МАХе:,13,! NTIM-',I3,' С=’,РБ.2,’ Uz',F5.2, 
47 1’ DXs',F5.3,' DTs',F5.3) 

48 WRITE(6, 10)S, АГРЕ, RCEL, Q, QQ 
49 WRITE(6,11)NEX,LN,EM 

50 10 FORMAT(’ Sz',F5.2,"' ALPH=’,E10.3,’  RCELz',F6.3, 
51 17 Qz',F5.2,' z'.F6.3) ` . 

52 11 FORMAT(’ NEX-?,I5,' EL-',I5,' =’ E10.3) 
53 C 


Fig. 9.8. Listing of program TRAN 


For а cell Reynolds number, R,,— 1.0, solutions for the various explicit and 
implicit schemes are compared in Tabie 9.6. All methods produce smooth solutions 
with the Lax-Wendroff scheme being the most accurate explicit scheme. The most 
accurate implicit scheme is (9.71) with q— 0.5, which corresponds to the third-order 
treatment of идТ/дх. 

For higher cell Reynolds numbers, results are presented for various Crank- 
Nicolson implicit schemes in Table 9.7 (Roen = 3.33) and Table 9.8 (К, = 100). The 
Crank-Nicolson finite difference scheme serves as a reference. This solution is 
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54 ТРОНЕ .GT. 376070 12 

55 55 = 5 

56 IF(ME .Ей. 2)55 = $ + 0.5*C*C 

57 АА = (0.5%С + SS) + 3.*Qü 

58 ВВ = 1. - 2.955 ~ 3.%00 

59 CC = -0.5*C + SS + QQ 

60 GOTO 13 

61 12 AA = EM - 0.25*C - 0.5%5 - 1.5%00 
62 ВВ = 1.0 ~ 2.0%ЕМ + S + 1.5%й0 

63 CC = ЕМ + 0.25*C - 0.5%5 - 0.5%10 
64 АЕ = ЕМ +0.25*С + 1.5*QQ + 0.5*5 
65 ВЕ = 1. ~ 2.*EM - 1.5+00 - $ 

66 СЕ = EM - 0.25*C + 0.5%00 + 0.5%5 


67 13 УКТТЕ(6, 14) АА, BB, CC, AE, BE, CE 
68 14 FORMAT(' AAz',F8.5,' BB-',F8.5,' CCz',F8.5," AEs',F8.5,' BE=’, 


69 1F8,5,^ CEs',F8.5,/) 

70 C 

"c INITIALISE T AND EVALUATE TEX 

72 с 

13 CALL EXSOL(JPR,JMAX,X,T, TEX, REX, ОХ, И, ALPH, TIMAX,EL) 
74 С . 

75 DO 16 J = 1,JMAX 

76 DO 15 K = 1,5 


77 18 АКЛ = 0. 
18 16 CONTINUE 


79 WRITE(6 ,17) TIM 

80 17 FORMAT(^ INITIAL SOLUTION, ТІМ s',F5.3) 
81 WRITE(6,18)(X(J),J21,JMAX) 

82 WRITE(6,19)(1(J),J21,JMAX) 


83 18 FORMAT(' X=", 12F6,3) 
84 19 ҒОҺМАТ(: Т-”,12Ғ6.3) 


85 С 
86 С MARCH SOLUTION ІШ TIME 

87 C 

88 DO 26 И = 1,¥TIM 

89 IF(ME .GT. 3)60TO 21 

90 C 

91 с EXPLICIT SCHEMES 7 
92 C 

93 D(1) = ТА) 

94 D(2) = T(1) 

95 D(3) = T(2) 

96 DO 20 J = 2, МАР 

97 IF(ME .EQ. 3)D(4) = D(1) 

98 D(1) = D(2) 

99 D(2) = D(3) 

100 D(3) s T(J+1) 

101 ТО) = Aà*D(1) + BB*D(2) + CCeD(3) 

102 IF(ME .EQ. 3)T(J) = T(J) - Qü*D(4) 

103 20 CONTINUE 

104 GOTO 26 


Fig. 9.8. (cont.) Listing of program TRAN 


oscillatory showing an overshoot trailing the front which is particularly marked for 
К..и= 100 (Table 9.8 and Fig. 9.11). The two low dispersion schemes (9.69, 71) аге 
seen to be effective for Roen = 3.33 in suppressing unphysical oscillations. 

For К, = 100 the mass operator scheme with =} + C?/12 produces a sharp 
non-oscillatory temperature front. But the four-point upwind scheme produces a 
more spread out temperature front with a slightly oscillatory behaviour upwind 
and downwind of the front. In Fig. 9.11 grid-point values produced by the Crank- 
Nicolson mass operator (CN-MO) scheme are plotted only where they differ from 
the exact solution. 
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105 C 
106 C TRIDIAGONAL SYSTEM FOR IMPLICIT SCHEMES 
107 C 
108 21 IF(MQ .EQ. 1)DIM = 1. 
108 DO 22 J = 2,JMAP 
110 ІМ:1-1 
111 A(1,JM) = 0.5*00 
112 A(2,JM) = AA 
113 А(3,1М) = BB 
114 A(4,JM) = CC 
115 R(JM) = AE*T(JM) + BE*T(J) + CE*T(J+1) 
116 IF(MQ .EQ. 1 .AND. 1.07. 2)DIM = TCIM-1) 
117 IF(MQ .EQ. 1)R(JM) = R(JM) - 0Q.5*QQ*DIM 
118 22 CONTINUE 
119 R(1) = R(1) - A(2,1)*T(1) 
120 IF(MQ .EQ. 0)GOTO 24 
121 R(1) = RCA) - A(1,1)*T(1) 
122 R(2) = А(2) - А(1,2)*1(1) 
4233 C — 
' 124 С REDUCE A TO TRIDIAGONAL FORM 
125 € 
| 126 DO 23 JM = 3,JMAF 
‹ 127 JMM = ЈИ - 1 
| 128 DUM = A(1,JM)/A(2, MM) 
‚ 128 A(2,JM) = A(2,JM) - A(3,JMM)*DUM 
130 AC3,JM) = 4(3,JM) - A(4,JMM)*DUM 
42-131 AC(1,JM) = 0. 
i 132 КОЖ) = RCJM) - R(JMM)*DUM 
] 133 23 CONTINUE 
"134 — A(,1) = 0. 
135 А(1,2) = 0. 
136 24 А(2,1) = 0. 
137 А(4, МАР) = 0. 
138 С 
139 CALL BANFAC(CA, МАР, 1) 
140 CALL BANSOL(R,TD,A,JMAF, 1) 
141 С 
142 DO 25 J = 2, МАР 
143 25 T(J) = TD(J-1) 
144 26 CONTINUE 
145 С 
146 WRITE(6,27)TIMAX 
147 27 FORMAT(' FINAL SOLUTION, ТТИ =',FS5.3) 
148 WRITE(6,18) (X(J),Jz1, JMAX) 
149 WRITEC6 ,19) (T(J), J=1, JMAX) 
150 WRITE(6,28) (TEX (J) , 721, JMAX) 
151 28 FORMAT(' TEX-',12F6.3) 
152 SUM = 0, 
153 DO 29 J = 2,JMAP 
154 29 SUM = SUM + (T(J) - TEX(J))*«2 
155 RMS = ЗОВТ(ЗОМ/ САМР-1.)) - 
156 WRITE(6,30)RMS 
157 30 FORMAT(' RMS ERRs',E12.5) 
158 STOP 
159 END 


Fig. 9.8. (cont.) Listing of program TRAN 


Further information about the relative performance of the two schemes (9.69 
and 71) can be deduced by carrying ош a Fourier analysis as in Sect. 9.2.1. The 
initial conditions are represented by a Fourier series and explicit expression are 
obtained from the discretised equation for the amplitude ratio G,, and the phase 
angle фи for each Fourier mode д. 
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SUBROUTINE EXSOL(JPR,JMAX,X,T, TEX,NEX,DX,U, ALPH, TIMAX,EL) 


SETS THE INITIAL T SOLUTION AND FINAL EXACT (TEX) SOLUTION 


e 


DIMENSION T(65),TEX(65),X(65) 
JMAP = JMAX - 1 

PI - 3.141592654 
IF(JPR .EQ. 1)XST 
IF(JPR .EQ. 2)XST 
DO 1 J = 1,JMAX 

4(1:1-1 


ин 
o 
. 


- 2.0 


XST + AJeDX 
ТО) = 0. 

TEX(J) = 0. 

IF(JPR .EQ. 2)GOTO 3 


m 


с EXACT SOLUTION FOR PROPAGATING SINE-WAVE 


JM = 0.1001/DX + 1.0 

INC s UsTIMAX/DX * 0.001 
DO 2 J = 1,JMH 

T(J) = SIN(10.»PI*X(J)) 


JP = J + INC 
2 TEX(JP) = T(J) 
RETUREM 
с 
с EXACT SOLUTION FOR PROPAGATING TEMP-FRONT 
с 
3 Та) = 1.0 
ТЕХ(1) = 1.0 


005 J = 2, ЛАР 
IF(x(J) 17. O&.)TCI) = 1.0 
IFCABS(X(J)) „ГЛ. 1.0E-04)T(J) = 0.5 


BBSBSBRBRNBRSRBRYR®santanennrs 
MN = oar Q {л o Q IM к QO 1X0 Qo O ОП o U һу — O шю О -4 щл UN н 


34 DEM = 0. 
35 DO 4 К = 1,NEX 

36 AK = 2*K - 1 

37 DUM = AK*PI/EL 

38 SNE = SIN(DUM*(I(J) -USTIMAX)) - 
39 DIM s - ALPH*DUMSDUMSTIMAX 

40 IF(DIM .LT. -20.)G0TO 5 

41 4 DEM = DEM + (SNE/AK)*EXP(DIM) 

42 5 TEX(J) = 0.5 - 2,*DEM/PI 

43 RETURN 

44 END Fig. 9.9. Listing of subroutine EXSOL 


PROPAGATING TEMP-FRONT 

МЕ = 2 LAX-WENDROFF 

JMAX= 21 NTIM= 10 Ся .25 U= .50 Dx= .200 DT= .100 

Sx .25 ALPE» .100E+00 RCEL= 1.000 Qs .00 QQ= .000 

NEX= 100 Еш. 20 ЕМ» .000Е+00 

АА= .40625 ВВ= .43750 ССя .15625 AE= .00000 ВЕ» .00000 CE» .00000 


INITIAL SOLUTION, ТІН = .000 
Х=-2.000-1.800-1.600-1.400-1.200-1.000 -.800 -.600 -.400 -.200 .000 .200 
X» „400 .600 .800 1.000 1.200 1.400 1.600 1.800 2.000 
Т= 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 .500 .000 
T= .000 .000 .000 .000 .000 .000 .000 .000 .000 
FINAL SOLUTION, ТІМ 1.000 
X2-2.000-1.800-1.600-1,.400-1.200-1.000 -.800 -.600 -.400 -.200 .000 .200 
X» .400 .600 .800 1.000 1.200 1.400 1.600 1.800 2.000 
T= 1.000 1.000 1.000 1.000 1.000 .999 .997 .991 .973 .934 .862 .747 
Тя .593 .422 .261 .137 .059 .020 .005 .001 .000 
ТЕХ= 1.000 1.000 1.000 1.000 1.000 1.000 .998 .993 .978 .941 .868 .749 
ТЕХ= .588 .412 .251 .132 .059 .022 .007 .002 .000 


RMS ERR» .44842Б-02 Кір. 9.10. Typical output from program TRAN 
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For the Crank-Nicolson mass operator scheme (9.69) the following expressions 
are obtained: 


- — 2 1 2\4 

G= [1 — (26 4- (1 — cos 9») + (0.5 Csin дһ) 4 (9.76) 

П-(29-5(1-со50,)17-К(0.5 Csin 8m) 

— Csin 0, [1 —28(1 — cos 0„)] 

— LEM Um, я 77 

tan фе —cos6,]? [0.5 Csin 0, T? — [s(1 — cos 6, 9.77) 
The corresponding expressions for the exact solution are 

Gex, m=exp[—CAx?/R,,,,] and tan@,, m=tan(— Chn) . (9.78) 


For conditions corresponding to Tables 9.7 and 9.8 the ratio G,,/G,, „апа ф, are 
presented in Tables 9.9 and 9.10 for various values of б„. 


Table 9.9. Wavelength behaviour for А, =3.33, C= 1.0, 4x=0.10 


Scheme 8,/л- 0.05 0.25 0.50 0.75 1.00 
CN-FDM 6,/6.,, 0.996 0.858 0.619 0.389 0.251 
CN-MO С6,/6,, 0.996 0841 0.573 0.683 1.003 
CN-4PU с,6,, 0.996 0.827 0.438 0.216 0.231 
exact X —9.00 — 45.00 —90.00 -13500 -18000 
CN-FDM д, —8.94 — 39.19 — 56.58 — 49.09 0.00 
CN-MO 4, —9.00 —45.37 — 100.21 -16420 -18000 


CN-4PU ф, —8.96 — 40.65 —71.29 — 130.14 — 180.00 
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Table 9.10. Wavelength behaviour for К, = 100, C= 1.0, Ах--0.10 





Scheme  б„/л= 0.05 0.25 0.50 075 100 
CN-FDM | G,/G,. 1.000 0.995 0.984 0970 0961 
CN-MO GG. 1.000 0.994 0.980 0967 1000 
CN-4PU СС" 1.000 0.958 0.674 0277 0010 
exact "m -900 —4500 -—9000 -—13500  —18000 
CN-FDM d, —8.94 —3894 5313 —3895 000 
СМ-МО/ф, —9.00 —4500 9001 -13508 -18000 
CN-4PU d. —8.96 4027 —6130 —7314  —180.00 


For the Crank-Nicolson four-point upwind scheme (9.71) the amplitude ratio 
G,, and phase angle ф„ аге | 


(псев зайва) әлә 

"o MI +5*(1—cos 6,,)]? + {0.5 Csin 6„[1+4(1— соз 6,,)/6]}? em 
and 

— Csin 6,,[1 + q(1— cos 8 m)/6] (9.80) 


tan Om тж cos б„)]2— (0.5Csin б„[1+ 4(1—созб„)/6]} " 


where s* = s+ qC(1— cos6,,)/3. The amplitude ratio and phase angle variation with 
0, for Ree = 3.33 and 100 are shown in Tables 9.9 and 9.10, respectively. - 

For Кн = 3.33 (Table 9.9) the finite difference Crank-Nicolson scheme is seen 
to show poor agreement with the exact phase angle particularly for short wave- 
lengths, 6,,>. By contrast, the mass operator and four-point schemes show good 
phase agreement. Both schemes tend to be rather dissipative at intermediate 
wavelengths (0, = 1/2) and the four-point upwind scheme is dissipative for shorter 
wavelengths as well. 

For Кё = 100 (Table 9.10) the Crank-Nicolson finite difference scheme main- 
tains the amplitude for all wavelengths but introduces very significant phase errors 
for short wavelengths. This is consistent with the oscillatory temperature front 
solution shown in Fig. 9.11. The four-point upwind scheme shows some phase error 
for intermediate wavelengths but compensates for this by having significant 
attenuation of short wavelengths. The corresponding temperature front solution 
(Fig. 9.11) shows some oscillation and some smoothing out of the sharp tempera- 
ture profile. The Crank-Nicolson mass operator scheme maintains the amplitude 
as well as the Crank-Nicolson finite difference scheme, but also achieves excellent 
phase behaviour. Consequently, the sharp profile of the temperature front 
(Fig. 9.11) is not unexpected. 

Although the variable 6 mass-operator scheme and the variable 4 four-point 
upwind scheme are both effective in reducing dispersion errors, their influence 
arises from the discretisation of different terms in the governing equation (9.56). 
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The mass operator is associated with the time derivative 27 /dt, whereas the four- 
point upwind differencing is associated with the convective term идТ /дх. It follows 
that the optimal mass-operator strategy is not available for the one-dimensional 
steady convection-diffusion equation (Sect. 9.3) However, for multidimensional 
transport equations, mass operators appear in the finite element method associated 
with undifferentiated spatial directions. Consequently, the variable 6 mass-oper- 
ator mechanism for reducing dispersion-like behaviour is available for both steady 
and unsteady multidimensional transport equations (Sect. 9.5). 

A similar Fourier analysis to the present is provided by Pinder and Gray (1977, 
pp. 150-162) for conventional finite difference and finite element schemes applied 
to the one-dimensional transport equation. A Fourier analysis of the Taylor- 
Galerkin formulation (Sect. 9.2.3) applied to the one-dimensional transport equa- 
tion is provided by Donea et al. (1984). 

In this section the various analytical techniques, von Neumann stability, 
modified equation approach and Fourier analysis, have been used to analyse some 
conventional schemes for convection-dominated problems and to design some 
better schemes. 


9.5 Two-Dimensional Transport Equation 


The additional complications of considering the multidimensional transport equa- 
tion are comparable to those in progressing from the one-dimensional to multi- 
dimensional diffusion equation. In two dimensions, the transport equation can be 
written 

әт әт OT дт rT 

= tu 0—0 25 =0 . 9.81 

ét ^ Ox ду Cox % ду? 9.81) 
The more severe stability restriction that occurred for explicit two-dimensional 
algorithms for the diffusion equation is also apparent for explicit representations of 
9.81. For example, the two-dimensional version of the FTCS scheme (9.58) pro- 
duces the algorithm 


Ту! =(s,+0.5C,) T5. 1, ‚+@- 25,— 25,) T5  Қ(%,-0.5С,)Т7% 1, 
+(s,+0.5C,) T?,-1+(s,-0.5C,) 7% 44, , where (9.82) 
5,-а,4/4х2, C,=udt/4x , s,=a,dt/dy?, C,=vdt/dAy. 


A von Neumann stability analysis of (9.82) indicates that for a stable solution, the 
following restrictions apply (Hindmarsh et al. 1984) 


С? С? 
(s,+s,)S05, —.—x2. (9.83) 
5 5 


If s,=s,=s then $50.25 for stability which is twice as severe as іп one dimension. 
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The other major problem in multidimensions is the introduction of artificial 
diffusion, and particularly artificial cross-wind diffusion associated with first-order 
differencing schemes. As we have already seen the use of a first-order time 
differencing or upwind differencing for the convective terms introduces artificial 
diffusion (and dispersion) terms. In two dimensions the equivalent first-order 
differencing scheme introduces artificial diffusivities 


a= —05u4x(C,—1) and c2 —0.5v4y(C,— 1) . (9.84) 


The errors introduced by these terms are particularly severe when the velocity 
vector is at 45? to either coordinate direction (de Vahl Davis and Mallinson 1976). 
This is discussed further in Sect. 9.5.3. 


9.5.1 Split Formulations 


Implicit algorithms for the two-dimensional transport equation avoid most of the 
stability difficulties of the explicit schemes but have the problem of achieving an 
efficient solution algorithm; just as for the diffusion equation. 

Here, an examination is made of a typical split formulation for the two- 
dimensional transport equation. Application of the Galerkin finite element method 
(Sect. 5.3) with bilinear rectangular elements, to the transport equation (9.81) 
produces the result 


M, & M, B 


А -Г-иМ,%91,-гМ, © L,+a,M, ® Lyx 
Lk 


+ aM, ® Ly] Tpx ! 0.85) y 


where L,2(—1,0, 1)/24х, Г,=(1,0, —1)7/24y and M,=(4, $ 1) for a uniform 
grid. The operators L,,, L,, and М, are given by (8.30 and 31). 

Equation (9.85) may be compared with (8.29). The additional terms in (9.85) 
come from the convective terms. However, the directional mass operators in the 
convective terms behave in the same manner as for the diffusion terms. That is, for 
the finite element method, they spread the influence of the differential operators, L, 
and L,, in the normal direction. This follows from the tensor product © and the 
form of the operators M, and M,. Equation (9. 85) с сап Бе generalised by defining 


ее. 


M,=(6,, 1—26,, ôs), М,=(6,, 1-25, ô) , (9.86) 
where д, and д, are parameters that may be chosen to improve the scheme just as 
6 in (9.21) was chosen in Sect.9.4.3 to minimise dispersion errors. The choice, 
6,=6,=0, produces a a three-point finite difference scheme; the choice, 6, =6,=%, 
gives the linear finite element scheme. 

With a three-level representation for ОТ /dt, (9.85) сап be written as 


revi aT 


т | (1— B) RHS"+ BRHS"*! (9.87) 


М.Ф маву 
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where 
AT*t1=T°t!_T" and AT"—T"—T" ! and 


RHS= — [М, ® (uL,—a,L,,)- M, @ (vL,—a,L,,)] Ty . 


This same scheme was applied to a difference representation, (8.26), of the diffusion 
equation. 

By introducing the same splitting as was applied to (8.29) the following two- 
stage algorithm for the two-dimensional transport equation is obtained: 


B At 
LE - АТ%- RHS" 
(мә (75 ana. „| Тў, = (55, 5 


(+; туу) 9 М,4Т),, (9.88) 
апа 


м, GL d -) аңы, а) | arit =атуь (9.89) 


Clearly the i inclusion of the convective terms, i.e. uL, and vL,, does not prevent the 


"efficient splitting of (9 9.87) inta a collection of “tridiagonal ia] subsystems of equations 
associated with each gridline. 

The algorithm given by (9.88, 89) is consistent with (9.81) and has a truncation 
error of O(At?, Ax?, Ay?) for the two-level scheme, y —0, В =0.5 and for the three- 
level scheme, у= 0.5, 821.0. The same order of accuracy is achieved whether а 
linear finite element or a three-point finite difference scheme is used to evaluate the 
operators M,, М,, L,, Lxx, L, and L,,. However, the coefficients in the truncation 
error are different i in the different cases and the finite element formulation gives 
more accurate solutions. 

A von Neumann stability analysis of (9.88 and 89) leads to a quadratic equation 
(for the three-level scheme) for the amplification factor б. Evaluation of this 
equation to assess the stability is most easily carried out numerically. The results 
indicate unconditional stability with 820.5 for both the finite element and finite 
difference formulations. 


9.5.2 THERM: Thermal Entry Problem 


In this section the split formulations developed іп Sect. 9.5.1 will be applied to the 
problem of the flow of a ‘cold’ fluid entering a ‘hot’ two-dimensional duct with a 
very high aspect-ratio cross-section. Away from the side-walls of the duct the 
temperature distribution is effectively two dimensional, and governed by the steady 
form of the following energy equation: 


aT дит) 207) . STO eT. 
dt дх ду "gx? "Oy ^" 





(9.90) 
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where х, = а; = а, thé thermal diffusivity. Equation (9.90) is expressed in unsteady 
form to facilitate a pseudo-transient solution strategy (Sect. 6.4). If the velocity 
components и and v are known (9.90) is a linear two-dimensional transport 
equation. 

It is convenient to solve (9.90) in nondimensional form, after introduction of the 
nondimensional variables 














T — T, u vRe 2.5u,t 
T'- , "ш--, ‘= , = " 9.91 
Т„— Т» н ил 7 2.5 ит, аКе 921 
„95%, уз’, дє Шз and Pro, 
аКе а у uCp 


where T, and Т, are the entry and wall temperatures, и, is the mean axial velocity 
and Re and Pr are the Reynolds and Prandtl numbers, respectively (Sect. 11.2.5). 
The particular choice for x' and the other nondimensional variables ensures that 
the temperature profile develops over approximately the same range of x' for 
different choices of Re. If the above nondimensionalisation is substituted into (9.90) 
the same equation results in terms of primed variables with 


10 1.6 
— =. 9.9 
Ax Pr Re? and 4a, Pr (9.92) 


In the rest of this section primes will be dropped from nondimensional variables. 
For the computational domain shown in Fig. 9.12, appropriate boundary 
conditions are - 


Т(0,у-0, onx=0 and 2.0 On X= Xmax (9.93) 


T(x, +1)=1, опу= +1. 


Application of the group finite element formulation (Sect. 10.3 and Fletcher 
1983) with linear interpolation on rectangular elements produces a system of 
ordinary differential equations 


A T =1.0 , чём =0. В 





y -1 т 
ox ^C 
и =1.5(1-у?) 
v а О, 
Т =0. 
Уз1.5 (1-у 


Fig. 9.12. Computational 
С domain for thermal entry 
X =Хтах problem 
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oT 
М, 69 М, Е | ш {М, 59 [— І.(«Т)-ға, Ly, T] 
ds 


+M, 8 [- LoT) aL, T]) pe ; (9.94) 


which differs from (9.85) only by the inclusion of u and v in the convective operators 
L, and L,. Consequently, the two-stage algorithm (9.88, 89) is implemented in 
program THERM (Fig. 9.13) with L, and L, operating on uT and 27, respectively. 

Program THERM is written to operate on any prescribed velocity distri- 
bution. However, to facilitate the comparison of the temperature solution with a 
semi-analytic solution (Brown 1960) a fully developed velocity distribution is 
assumed 


и-1.(1-у), 2-0. (9.95) 


To avoid the discontinuity in the solution at A (Fig. 9.12) the following boundary 
condition on x = 0 is implemented in THERM: 


T(0, уу= y? . (9.96) 


This produces an inlet temperature profile that is zero except close to the corners 
х=0, у= +1. 

The evaluation of RHS" іп (9.88) is carried out in subroutine ВЕТНЕ 
(Fig. 9.14). The major parameters used in program THERM and subroutine 
RETHE are described in Table 9.11. Typical output from THERM is shown in 
Fig. 9.15 for the approximate factorisation finite difference method. 

The centre-line solution is compared with the semi-analytic solution (Brown 
1960) which is evaluated in subroutine TEXCL (Fig. 16.5; Vol. 2, p. 268). The semi- 
analytic solution is based on a reduced form of (9.90) which neglects the axial 
diffusion term „д? T/dx*. The justification for this is discussed in Sect. 16.1.4. It may 
be noted that the resulting semi-analytic solution is a good approximation to the 
solution of the present problem except close to the entrance (x=0) and for smail 
values of Re. Because of the limitations on Brown's solution it will be referred to as 
semi-exact in program THERM and Table 9.12. 

The centre-line solutions for a relatively coarse grid are shown in Table 9.12 
and are plotted in Fig. 9.16. The rms errors shown in Table 9.12 are based on the 
centre-line solution and exclude the point j2 2 because of the possible doubtful 
relevance of the semi-exact solution close to the entrance х=0. 

Also shown in Table 9.12 and Fig. 9.16 are results obtained with the four-point 
upwind discretisation of Q(u T)/Ox replacing L, in (9.94) and the equivalent of (9.88) 
and (9.89). This is the same four-point upwind discretisation that was examined in 
Sects. 9.3.2 and 9.4.3. A consistent treatment of a two-dimensional problem implies 
that a four-point upwind discretisation should also be introduced for o(vT)/Oy. 
This is not done in program THERM because v is zero in the current example 
and consequently, the term д(07)/ду has no influence on the solution. The four- 
point upwind scheme is not applied at j = 2 because of the difficulty of ascribing an 
appropriate value to T, ,. 
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THERM APPLIES APPROXIMATION FACTORISATION TO SOLVE 
THE UNSTEADY THERMAL ENTRY PROBLEM FOR T(X,Y) 


DIMENSION T(41,42),DT(41,42),R(41,42),U(41,42) ,V(41,42), 
1EMX(3),ENY(3),B(5,65) ,RRT(65) , DDT(65) , CI (3) , CY (3) , TEX (42) 


2, ALFC10), DYFL(10) , CXQ(42 


COMMON CX,CXQ,CY,CCX, CCY, EMX, ENY, WX, NY, R, T, DT,U,V 
DATA  ALF/1.6815953,5.6698573,9.6682425,13.6676614,17.6673736, 
121.6672053,25.6670965,29.6670210,33.6669661,37.6664327/ 
DATA DYFL/-0.9904370,1.1791073,-1,2862487,1.3620196,-1,4213257, 
11.4704012,-1.5124603,1.5493860,-1.5823802,1.6122503/ 


OPEN (1,FILE-'TRERM,DAT') 

OPEN (6, FILEs' THERM. QUT ') 
READ(C1,12NX, NY , ME, TTMAX,GAM,BET,Q 
READ(1,2)DTIM, EPS ,RE,PR,XMAX 
FORMAT(4IS,3F5.2) 

2 FORMAT(3E10.3,2F5.2) 


[Еч 


WX + 
NXP = NX - 
NYP = ШҮ - 
WYPP = ЖҮР 
NYH = NY/2 
ANX - NIP 
DX = XMAI/ANX 

ANY = WYP 

Оу = 2./АЯҮ 

ALI = 10./RE/RE/PR 
ALY = 1.6/PR 

CX(1) = -0.5/DX 
cx(2) =9 
сх(3) 
cy(i) 
сү(2) 
СҮСЗ) 
Сха(1) 
cxaC2) 


EXS 


" m wu 
+ liekna 
а һа 


0.5/рХ 
-0.5/DY 
0. 

0.5/DY 

Q/DX/3. 
-3.*CXQC1) 
GXxXQ(3) -CXQ(2) 

CXq(4) = -CXQ(1) 
CCX = ALX/DX/DX 

CCY = ALY/DY/DY 

IF(ME .НЕ. 220 = 0. 
EMX(1) = 0. 

IF(ME .EQ. 3)EMX(1) s 1./6. 
ENX(2) = 1. - 2. *EMX(1) 


uon wa 


паа и 


ЕМХ(3) = EMX(1) 
003 J = 1,3 
3 ЕМҮ(І) = EMX(J) 


IF(ME .EQ. 1)WRITE(6,4)NX,NY, ME, ITMAX 
IF(ME .EQ. 2WRITE(6,5)NX,NY ME, ITMAX 
IF(ME .EQ. 3)WRITE(6,6)NX,NY, ME, ITMAX 


4 FORMAT(? THERMAL ENTRY PROBLEM: 3PT 
112,’ ITMAX =',13) 


5 FORMAT(' THERMAL ENTRY PROBLEM: 4PT UPWIND РОМ, 


1’ ME 5',I2,' ІТМАХ=',13) 


РОМ, ИХ,ИҮ =’, 213,’ 


6 FORMAT(' THERMAL ENTRY PROBLEM: LIN FOM, ИХ, МУ =',213,’ 


112,’ ІТМАХ =',13) 
WRITE(6,7)GAM,BET,DTIM,AMAX,PR,RE,Q 


7 FORMAT(? САМ -!,Е5.2,! BETA =’,F&.2,' DTIM =’, Fé.3,/ 


1F5.2,' PR =’,F5.2,’ RE z',F5.1,' 


GENERATE INITIAL SOLUTION 


Fig. 9.13. Listing of program THERM 


Q =',Р5.2,/) 


ME =’, 


ИХ, ШУ = ,273, 


ME =’, 


XMAX =", 
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67 DD 9 K = 1, му 

68 АК =К- 1 

в9 Y = -1, + AK*DY 

70 T(K,1) = У+**32 

71 T(K,NXS) = 1. 

72 U(K,1) = 1.5*(1. - Ye¥) 

73 V(K,1) = 0. 

74 DO 8 J = 2,Nxs 

75 U(K,J) = UCK,1) 

76 Y(K,J) = 0. 

77 IF(J .EQ. ШХ5)бОТО 8 

78 АЈ = 1-1 

79 TCK,J) = Т(К,1) + AJ*«DX*«(T(K,NXS) -TCK, 12) /XMAX 
80 8 CONTINUE 

81 9 CONTINUE 

82 ро 12 J = 1,NxS 

83 DO 10 K = 1,NY 

34 DT(K,J) = 0. 

85 10 R(K,J) = 0. 

86 DO 11 K = 1,5 

87 11 B(K,J) = 0. 

88 12 CONTINUE 

89 GAMH = GAM 

90 GAM = 0. 

91 BETH = ВЕТ 

92 BET = 0.5 

93 ITER = 0 

94 C 

95 13 DBT = BET*DTIM/(1.+GAM) 

96 СХА = 0,5*DBT/DX 

97 CYA = 0.5*DBT/DY 

98 ССХА = DBT*CCI 

99 CCYA = DBTeCCY 

100 С 

101 CALL RETHECRMST, GAM, DTIM) 

102 С 

103 IF(RMST .LT. EPS)GOTO 24 

104 IF(RMST .GT. 1,0E+04)GOTO 24 

105 С 

106 С TRIDIÁGONAL SYSTEMS IN THE X-DIRECTION 
107 c 

108 Do 19 К = 2,NYP 

109 DO 15 J = 2,NX 

110 ІМ-1-1 

111 В(2,1М) = EMX(1) - COXA - CXA«U(K, JM) 
112 B(3,JM) = EMX(2) + 2.*ССХА 

113 B(4,JM) = EMX(3) - ССХА + CXaeUCK, 1+1) 
114 RRT(JM) = R(K,J) 

115 IF(ME .NE. 2)GOTO 15 

116 LST = 1 . 
117 IF(J .EQ. 2)15Т = 2 ` 
118 B(1,JM) = 0. 

119 DO 14 L=LST,4 

120 LJ=]+L-3 
121 14 B(L,JM) = B(L,JM) + CXQ(L)*«U(K,LJ)*DBT 
122 15 CONTINUE 

123 B(2,1) = 9. 
124 B(2,JM) = B(2,JM) + B(4,JM) 
125 B(4,JM) = 0. 

126 IF(ME „ШЕ. 2)бОТО 17 
127 С 
128 С REDUCE TO TRIDIAGONAL FORM 
129 С 
130 DO 16 ІМ = 3,ИХР 
131 IMM = JM - 1 
132 DUM - B(1,JM)/B(2,JMM) 


Fig. 9.13. (cont.) Listing of program THERM 
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B(2,JM) = B(2,JM) - B(3,JMM)*DUM 

B(3,JM) = B(3,JM) ~ B(4,JMM) *DUM 

B(1,JM) = 0. 

RRT(JM) = RRT(JM) - RRT(JMM)*DUM 
16 CONTINUE 

B(1,2) = 0. 


17 CALL BANFAC(B,WXP,1) 
CALL BANSOL(RRT,DDT,B,WXP, 1) 


DO 18 J = 2,NX 


ІМ-1-і1 
18 RCK,J) = DDT(JM) 
19 CONTINUE 


TRIDIAGONAL SYSTEMS IN THE Y-DIRECTION 


ро 22 J = 2,WNX 
DO 20 К = 2,ЖҮР 


KK =K-1 
B(2,KM) = ЕМҮСІ) ~ CCYA - CYTA*V(KM,J) 
B(3,KM) = EMY(2) + 2. CCYA 
B(4,KM) = ENY(3) - ССҮА + CYA*V(K+1,J) 

20 RRT(KM) = R(K,J) 

B(2,1) 50. 
B(4,KM) = 0. 


CALL BANFAC(B,NYPP,1) 
CALL BANSOL(RRT,DDT,B,WYPP, 1) 


INCREMENT T 


DO 21 К = 2,NYP 
KK-*K-1 
DT(K,J) = DDT(KM) 
21 TCK,J) = T(K,J) + DDTOOD 
22 CONTINUE 
DO 23 К = 2,NYP 
T(K,NXS) = T(K,NIP) 
23 DT(K,NXS) = DT(K,NXP) 
ITER = ITER + 1 
САМ = GAME 
ВЕТ = BETH 
ІЕ(ІТЕА .LE. ITMAX)GOTO 13 


COMPARE WITH SEMI-EXACT SOLUTION 


24 SUM = 0. 
ро 25 J = 2,NX 
АЈ =J-1 
X = AJ*DX 


CALL TEXCL(X,TDI,PR, ALF, DYFL) 


TEX(J) = ТІХ 
IF(J .EQ. 2)G0TD 25 
DIF = T(NYE,J) - TDX 
SUM = SUM + DIF*DIF 
25 CONTINUE 
RMS = SQRT(SUM/(ANX~1.0)) 
МАТТЕ( 6,26) ITER, АМЅТ, RMS 
26 FORMAT(’ AFTER ',I3,' ITERATIONS, RMS-RHS z',E10.3, 
1% RMS-ERR -',E10.3) 
WVRITE(6,27) (T(ITBR, J) 222, X) 
WRITE(6,28) (TEX(J), Jz2, NX) 
27 FORMAT(’ СЛ. TEMP z',11F6.3) 
28 FORMAT(^ EX TEMP -”,11Ғ6.3) 
STOP 
END 
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Fig. 9.13. (cont.) Listing of program THERM 
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1 

2 SUBROUTINE RETHE(RMST, САМ, ТІМ) 

3c 

4c EVALUATES RIGHT-HAND SIDE OF THE 

5c TWO-DIMENSIONAL TRANSPORT EQUATION 

6c 

7 DOUBLE PRECISION RTD,SUMT,AN,DSQRT 

8 DIMENSION DT(41,42),T(41,42),R (41,42) ,U(41,42), V(41,42), 
9 1DMX(3,3),DMY(3,3) , ENX(3) ‚ЕМУ (3), CX (32, CY (32, CIQ(4) 
10 COMMON CXI,CXQ,CY,CCX,CCY, ENX, ЕМУ, ИХ, ШУ, R, T, DT,U,V 
11 ИТР = WX - 1 

12 TYP = NY - 1 

13 DO 1 I= 1,3 

14 DMX(I,1) = CCX*EMY(I) 

15 DMX(I,2) = -2.*DMX(I,1) 

16 DMX(I,3) = DMXCI, 1) 

17 DMY(1,I) = CCY*EMX(CI) 

18 DMY(2,I) = -2.*DMY(1,I) 

19 1 DMY(3,1) = DEY(1,1) 
20 С 
21 SUMT = 0. 
22 ро 6 J =2, их 
23 005 К = 2,NYP 
24 RTD = 0. 
25 004 И = 1,3 
26 NK-K-291 
27 DO 2 M= 1,3 
28 MJ2J-2*4M 
29 DUM = DMX(N,M) + DMY(N,M) - CXCM) *EMY (N) *UCWK,MJ) 
30 1 - CY(ODSEMXCOSVOIK MJ) 
31 2 RTD = RTD + DUM*T(OIK,MJ? *DTIM 
32 1 + GAM*ENX(M)*EMY (И) «ОТ (ИК, MJ) 

33 MST = 1 
34 IF(J .Ей. 2)MST = 2 
35 DO 3 M = MST,4 
36 MJ =J-3+Y% 
37 3 RID = RTD - CXQ(M)*EMY (NW) sUCEK,MJ)*TCEK MJ) *DTIM 
38 IF(J .EQ. 2)RTD = RTD - CXQ(1)*EMY (N) «ОСИК, 1) *TCIK, 1) «DTIM 
39 4 CONTINUE 
40 SUNT = SUMT + RTD*RTD 
41 5 R(K,J) = RTD/(1. + САМ) 
42 6 CONTINUE 
43 AN = NXP*(OIYP-1) 
44 RMST = DSQRT(SUMT/AN) /DTIM 
45 RETURE 
46 END Fig. 9.14. Listing of subroutine RETHE 


It is clear that all methods are giving good agreement with the semi-exact 


solution, in view of the coarseness of the grid. The particularly close agreement of 
the linear finite element method is consistent with the steady-state modified 
equation which is actually being solved. 


The relative behaviour of the mass operator construction and the four-point 


upwind discretisation for the convective terms can be compared by considering the 
steady-state modified equation for a scheme incorporating both features. This is 


2T. 
х дҳ? 


4 027 
э, 
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Table 9.11. Parameters used in program THERM 





Variable Description 
ME =1, approximate factorisation, three-point finite difference method, AF-FDM 


=2, approximate factorisation, four-point upwind scheme, 4=0.5, AF-4PU 
=3, approximate factorisation, linear finite element method, 6=1/6, AF-FEM 


ITMAX maximum number of iterations 

GAM, BET у, В in (9.88, 89) 

4 parameter controlling four-point upwind scheme, 4, in (9.97) 

NX, NY number of points in the x and y directions 

DTIM At 

EPS tolerance on the magnitude of RHS”, (9.88) 

RE, PR Reynolds number, Prandtl number 

XMAX downstream extent of the computational domain 

ALX, ALY &,, у, in (9.90, 92) 

СХ, СҮ L,, L, in (9.88, 89) 

CXQ increment to L, to generate [9 

EMX, EMY M,, M, in (9.88, 89) 

T; U, V temperature; x and y velocity components 

TEX semi-exact centre-line temperature 

TEXCL calculates TEX for given x and Pr; requires ALF апа DYFL; described in 
Sect. 16.1.4 

DT АТ} in (9.88) 

DDT AT}, and AT; 1! on return from BANSOL 

B quadridiagonal and tridiagonal matrix; left-hand sides of (9.88, 89) 

R RHS" in (9.88), evaluated in RETHE; also used to store АТУ, in (9.89) 

RMS;RMST Тон Тс. 1: ПЕН", 


THERMAL ENTRY PROBLEM: ЭРТ FDM, NX,NY = 11 11 МЕ = 1 ITMAX = 25 
GAM = .50 BETA = 1.00 DTIM = .200 XMAX = 2.00 PR= .70 RE=100.0 0 = .00 


AFTER 20 ITERATIONS, RMS-RHS = .857Е-05 RHS-ERR = .180E-01 
C/L TEMP = .459 .744 .881 .945 .975 .988 .995 .998 „999 1.000 
EX TEMP = .493 .786 .909 .962 .984 .993 .997 .999 .999 1.000 


Fig. 9.15. Typical output from program THERM 


where 4, and q, are the free parameters introduced into the four-point upwind 
discretisation of д(иТ)/дх and д(гТ)/ду, respectively. The parameters д, and д, 
replace 6 in the definitions of M, and M,, (9.94). The cell Reynolds numbers are 
defined as 


A A 
RT and К.у . (9.98) 


х y 


In the present example, v —0 so that К. , —0. In contrast to the situation for the 
convection-diffusion equation, Reen, х is a function of position: 

It is clear from (9.97) that either of the choices 4, = q, —0.5, 6, =ô, —0, or q,— q, 
=0, 0,— à, =$ will eliminate the “dispersive” third-order derivatives. However, the 
ensuing treatment of the dissipative fourth-order derivatives is somewhat different, 
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Table 9.12. Centre-line solutions for thermal entry problem, 11 x 11 grid, Ке = 100, Рг--0,70, у =0.5, 
B-10 


x rms 
Scheme 0.000 0.200 0400 0.600 0.800 1.000 1.200 1.400 1.600 1.800 2.000 error 


(semi-exact 0.000 0.493 0.786 0910 0.962 0,984 0,993 0,997 0.999 1.000 1.000 -- 
AF-FDM 0000 0459 0.744 0.881 0945 0.975 0,988 0.995 0998 0.999 1.000 0018 
AF-4PU, 

4=0.5 0.000 0452 0754 0.898 0959 0.984 0.994 0.998 0.999 1.000 1.000 0011 
АҒ-ҒЕМ 0.000 0.462 0.794 0910 0.963 0.984 0,994 0.997 0.999 0999 1.000 0003 





1.8 


ТЕМ 






+ЯР-РОМ 
OAF-4PU 
Ф AF-FEM 
— EXACT 


8.5 





Fig. 9.16. Comparison of centre-line temperature 


distribution 
В.В 1.8 X 2.8 


particularly for large values of the cell Reynolds numbers. Both approaches 
introduce positive dissipation, but for the mass operator scheme the dissipation 
diminishes with increasing cell Reynolds number whereas for the four-point 
upwind scheme it does not. The results shown in Table 9.12 are consistent with 
(9.97). 


9.5.3 Cross-Stream Diffusion 


The use of two-point upwind differencing to represent convective terms introduces 
an artificial diffusivity «’=0.5uAx (Tables 9.1 and 9.3) for one-dimensional prob- 
lems. In multi-dimensions the impact of artificial diffusivity is more usefully 
considered in relation to the local flow direction. Thus in two dimensions the 
streamwise and cross-stream artificial diffusivities are of greater physical interest 
than artificial diffusivities expressed relative to the coordinate directions. 
The two-dimensional steady convection diffusion equation can be written (with 
а= а, = а, and и, v constant) 
_ _ 25 ak 
OT | OT ( T д т)=0 (9.99) 


и-—+0-———@ 


дх ду 


ax? 1 ду? 
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It will be assumed that two-point upwind differencing is used to discretise 
2Т/дх and дТ/ду and that three-point centred differencing is used to discretise 
22Т/дх2 and 42Т/ду?. A Taylor-series expansion of the resulting discrete equation 
indicates consistency to O(Ax?, Ay?) with 


OT oT Т 


дТ ; ; 
—— + аа) у= 


изу dy 0, (9.100) 


where 2; =0.5и4х and о; =0.504у. Equation (9.100) indicates the presence of 
artificial diffusivity associated with each coordinate direction and is clearly a direct 
extension of the one-dimensional result (9.51). 

It is convenient to transform (9.100) based on a cartesian coordinate system, 
tangential to the local flow direction, 


4 5; 952 Ont ‘ — (9.101) 


(Os "sn д? , 


(т 27) T , T DT 


where 
и, — 0.5q(Ax cos*a + Aysin?a) , 
a, = 0.5q(dysina—Axcosa)sin2a , 
a, 2-0.5q(Axcos xsin?x + Aysina cos?a) . 


In (9.101) s and n denote directions tangential and normal to the local flow 
direction, respectively. In addition the uniform flow q makes an angle « with the x 
axis, so that 


u=qcosa and v=qsine. (9.102) 


The first three terms in (9.101) arise from the direct transformation of (9.99). The 
last three terms are artificial diffusion terms. For the case Ах = Ay it is clear that the 
cross-stream artificial diffusivity o7, is a maximum at «= 45? but is zero if the flow is 
aligned with either the x or the y axis. In contrast the stream-wise artificial 
diffusivity is a maximum at «=0 or 90° and falls to a minimum at «=45°. 

At first sight it might appear that streamwise artificial diffusion is of more 
concern since for Ax= Ay, || > |0, | for all х in the range O< 90°. However, for 
most flows, 027/052 is very small; an exception is the region adjacent to a 
stagnation point. In contrast for free shear layers or boundary layers (Chap. 16) 
д2Т/ди? > 02 T/Os? and 22Т/дп? is of comparable magnitude to the convective term 
491/05. Consequently cross-stream artificial diffusion may introduce significant 
errors if a low-order discretisation of the convective terms is utilised and the local 
flow direction does not coincide with the coordinate lines. Conversely methods that 
add purely streamwise artificial diffusion, e.g. Brooks and Hughes (1982), are 
effective in producing oscillation-free accurate solution for many types of flow 
problems. 
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Cross-stream artificial diffusion is examined theoretically and numerically by 
de Vahl Davis and Mallinson (1976), Raithby (1976) and Griffiths and Mitchell 
(1979). 


9.6 Closure 


The various computational techniques introduced in conjunction with the dif- 
fusion equation in Chap. 7 have been applied to a sequence of equations: a linear 
convection equation, steady convection diffusion equation and transport equation. 
This sequence can be interpreted as models, of varying degrees of complexity, for 
the equations (Chap. 11) governing flow problems. Related techniques will be 
applied to the fluid dynamic governing equations in Chaps. 14-18. 

The examination of the model equations has drawn attention to the need to 
recognise and control numerical dissipation and dispersion. This is particularly 
troublesome when first-order algebraic representations are used with diffusion 
either absent or of a much smaller magnitude than convection. The second 
situation, of convection dominating diffusion, also permits non-physical oscil- 
lations to occur if symmetric three-point algebraic formulae are used to discretise 
the convective terms on a coarse grid. However, the introduction of mass oper- 
ators, which are a feature of the finite element method, provide a mechanism for 
exercising some control over dispersion-like oscillations. Through the splitting 
constructions, introduced in Sect. 8.3, mass operators are available in steady and 
unsteady multidimensional problems. 

Ап alternative means of controlling dispersion-like oscillations is provided by 
the introduction of a four-point upwind discretisation of the convective terms. This 
technique is effective although it may introduce additional higher-order dissipation. 
The four-point upwind discretisation can be incorporated into implicit algorithms 
but typically produces a quadridiagonal system of equations that can be solved 
by the Thomas algorithm after first reducing the system to tridiagonal form. The 
use of a four-point scheme also introduces the possible need of additional 
boundary conditions at inflow boundaries or the local reduction to a three-point 
scheme. 

Comparison of the methods of analysis used in this chapter emphasises the 
cost-effectiveness of the modified equation approach (Sect. 9.2.2) for constructing 
improved algorithms. Because of the underlying Taylor series expansion this 
approach only provides information about the long wavelength behaviour. 

The short wavelength behaviour requires a Fourier analysis (Sect. 9.2.1) which, 
in practice, also provides a von Neumann stability analysis (Sect. 4.3). However, 
although the modified equation approach extends directly to nonlinear schemes 
and equations, the Fourier analysis requires a local linearisation which may detract 
from its effectiveness for nonlinear flow problems (Chaps. 10, 11 and 14-18). 
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9.7 Problems 


1-D Linear Convection Equation (Sect. 9.1) 


9.1 
9.2 


9.3 


Obtain the expression for the truncation error shown in Table 9.1 for the Lax- 
Wendroff scheme. 

Apply the von Neumann stability analysis to the Crank-Nicolson mass 
operator form (9.22) and confirm that there is no stability restriction. 
Apply program TRAN to the convecting sine wave problem (Sect. 9.1.5) using 
the Crank-Nicolson mass operator scheme and compare solutions with those 
shown in Figs. 9.2-4. 


Numerical Dissipation and Dispersion (Sect. 9.2) 


9.4 


95 


9.6 


Apply the Fourier analysis, Sect.9.2.1, to the one-dimensional convection 
equation for the following schemes: 

(i) Upwind difference scheme, 

(ii) Lax-Wendroff scheme, 
(ш) Crank-Nicolson finite difference scheme, 
with C=0.8 and 4x=0.5. Obtain results equivalent to Table 9.9; it is 
recommended that a computer program be written to do this. Analyze the 
solutions shown in Figs. 9,2—4 in relation to the Fourier analysis. 
Apply the modified equation approach to: 

(i) Upwind difference scheme, 

(ii) Lax-Wendroff scheme, 
(iii) Crank-Nicolson finite difference scheme, 
for the one-dimensional convection equation and confirm the results given in 
Table 9.1. Compare the predicted behaviour with that of the Fourier analysis 
of Problem 9.4 and the results shown in Figs. 9.2-4. 
Construct a generalised, three-level discretisation, equivalent to (8.26), of the 
one-dimensional convection equation. By considering the truncation error 
determine if there are optimal choices of у and В for minimising dispersion and 
dissipation errors. Modify program TRAN to run this case and test on the 
convecting sine-wave problem. 


Steady Convection-Diffusion Equation (Sect. 9.3) 


9.7 


9.8 


9.9 


Incorporate the centred difference scheme (9.45) and the upwind scheme (9.50) 
into a computer program and confirm the results shown in Figs. 9.5 and 9.6. 
Implement the program developed in Problem 9.7 for u/a=5 and 
Ax = 0.1 with a Neumann boundary condition, dT/dx = g at x = 1.0. Choose g 
to satisfy Eq. (9.44). Compare the solution with a corresponding Dirichlet 
boundary condition case. 

Introduce symmetric five-point schemes for 4Т/4х and d^T/dx?, by mini- 
mising the truncation error with one adjustable parameter left in each 
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expression. Apply this scheme to the convection diffusion equation with 
Dirichlet boundary conditions (9.43). Use the exact solution (9.44) to supply 
the additional boundary conditions required. Modify subroutines BANFAC 
and BANSOL to solve the resulting pentadiagonal matrix. Obtain solutions 
to compare with those given in Table 9.2 and determine whether specific 
choices of the two adjustable parameters will produce more accurate, but 
stable, solutions. 


1-D Transport Equation (Sect. 9.4) 


9.10 Apply the modified equation analysis to the Lax-Wendroff and Crank- 
Nicolson discretisations of the one-dimensional transport equation and con- 
firm the results given in Table 9.3. 

9.11 For the explicit four-point upwind scheme, i.e. (9.71) with Т” replacing 0.5(7" 
+ T? * !) in the spatial operator, determine an optimal value of q to reduce the 
size of the truncation error. Apply the von Neumann stability analysis to 
determine any stability limits. Run program TRAN with the optimum 4, if 
stable, for conditions corresponding to Table 9.6 and compare results. 

9.12 Modify program TRAN to implement the three-level fully implicit scheme 
(ТаЫе 9.3). Obtain solutions to the propagating temperature front problem 
(Sect. 9.4.3) for increasing cell Reynolds numbers and determine empirically 
the limit for oscillation-free solutions. Repeat for the Crank-Nicolson finite 
difference and finite element schemes. 


2-D Transport Equation (Sect. 9.5) 


9.13 By considering the truncation error of the Crank-Nicolson mass operator 
formulation of the two-dimensional transport equation (9.81) determine 
optimum values of 6, and 9, in (9.86) to minimise dispersion errors. Apply а 
von Neumann analysis to (9.88 and 89) with y 2 0 and f = 0.5 to establish 
whether the optimal values will produce stable solutions. 

9.14 For program THERM compare AF-FDM; AF-4PU (4= 0.5) and AF-FEM 
for the number of iterations to reach convergence. Do this for the three 
marching algorithms, (a) y =0, B 20.5, (b) у=0, В=1.0, (c) у 20.5, В = 1.0, and 
for grids, 11 x 11 and 21 x 21. 

9.15 Based on the results in Table 9.12, Problem 9.14 and any additional solutions 
from THERM compare the computational efficiency of AF-FDM, AF-4PU 
and AF-FEM. 


10. Nonlinear Convection-Dominated Problems 


In Chap.9 linear convection-dominated problems are considered. It is demon- 
strated there that care is required in discretising, particularly the convective terms, 
to avoid introducing spurious higher-order derivatives (in the modified equation 
sense) that may be of comparable magnitude to physically important terms in the 
governing equation. This is a particular problem with two-point upwind dif- 
ferencing. | 

The identification of higher-order derivatives in the modified equation 
(Sect. 9.2.2) with the processes of dissipation and dispersion allows more accurate 
computational algorithms to be developed. For example, a four-point upwind 
scheme avoids excessive dispersion-related oscillations (Sects. 9.3.2 and 9.4.3) with- 
out introducing unacceptable dissipation, as is the case with the two-point upwind 
scheme. 

For the energy equation (Sect. 11.2.4) the convective terms are linear if the 
velocity field is known. However the convective terms in the momentum equations 
(Sects. 11.2.2 and 11.2.3) are nonlinear. In this chapter the additional complications, 
associated with nonlinear convection, are considered. The one-dimensional 
Burgers’ equation is a suitable model for this purpose since it possesses the same 
form of convective nonlinearity as the incompressible Navier-Stokes equations 
(Sect. 11.5) and has readily evaluated exact solutions for many combinations of 
initial and boundary conditions. 

The convective nonlinearity must be be suppressed i in applying the von Neumann 
stability analysis. This is done by treating solution-dependent coefficients multi- 
plying derivatives as being temporarily frozen. The modified equation approach to 
analysing nonlinear computational algorithms is applicable (Klopfer and McRae 
1983) but the appearance of products of higher-order derivatives makes the 
construction of more accurate schemes less precise (Sect. 10.1.4) than was the case 
for linear equations (Sect. 9.4.3). 

The nonlinear nature of the convective terms permits severe gradients in the 
dependent variables to develop. This suggests sts the use ofa п nonuniform rm grid locally 
refined in the vicinity of the зе rapid soluti solution change. The influence of a nonuniform 
grid on the truncation error error and solution accuracy are examined briefly in 
Sect. 10.1.5. This subject is treated at greater length in Chap. 12. 

The governing equations for fluid dynamics (Chap.11) usually occur as 
systems, as well as being nonlinear. The additional complications of constructing 


_computational algorithms for systems of nonlinear equations are considered, ina 


meets ae e a a o —— 


preliminary fashion, in Sect. 10. 2. 
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The Galerkin finite element method (Chap. 5) produces accurate and 
economical algorithms for linear convection-dominated problems (Sects. 9.1.4, 
9.4.3 and 9.5). However, the treatment of convective nonlinearities is potentially 
uneconomical in multidimensions or if higher-order interpolation is used. The lack 
of economy can be overcome by adopting a group finite element formulation; this is 
described in Sect. 10.3. 

The split schemes, developed in Sects. 8.2, 8.3 and 9.5, are extended to the two- 
dimensional Burgers’ equations in Sect. 10.4.2. The two-dimensional Burgers’ 
equations, like the one-dimensional form, have a readily available exact solution, at 
least in the steady-state limit, thereby permiting direct evaluation of the accuracy of 
specific computational algorithms (Sect. 10.4.3). 


10.1 One-Dimensional Burgers’ Equation 


In one dimension, Burgers’ equation (Burgers 1948) is 
—Ttül—v—-0. (10.1) 
x 


Burgers equation is similar to the transport equation (9.56) except that the 
convective term is now nonlinear. This is the same form of nonlinearity as appears 
in the momentum equations, Sects 11.2.2 and 11.2.3. To emphasise the connection 
with the momentum equations, the diffusivity « in (9.56) has been replaced with the 
kinematic viscosity v in (10.1). Further discussion of Burgers' equation is provided 
by Fletcher (1983a) and Whitham (1974). 


10.1.1 Physical Behaviour 


If the ‘viscous’ term is dropped from (10.1) the result is the inviscid Burgers’ 
equation 


—+й—=0, | (10.2) 
х 


which тау be compared with (9.2). The nonlinearity іп (10.2) allows discontinuous 
solutions to develop. The way that this can occur is 1 occur is illustrated schematically in 


Fig. 10. ТА wave is convecting from left to right and solutions for successive times 
t=0, t,, t; are indicated. Points on the wave with larger values | of à convect faster 


and consequently ‹ overtake parts ¢ of the wave convecting w with si ith smaller _ values of и. 
For (10.2) to have a unique solution (and a physically sensible resuli result) it is necessary 


to ) postulate a shock (ab in Fig. 10.1) across_ which. ü changes discc discontinuously. 


л ——— — 


The comparable wave development | for the * viscous’ us Burgers” equation (10.1) is 
shown in Fig. 10. 10.2. The effect of the viscous ferm дд is twofold. First, it 


reduces tlie amplitude of the wave for increasing t(j just as for the transport 
ы аши жайылы лиса... ж-ты ею — 


н а ем з ———— —- 
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tt 


> Fig. 10.1. Formation of a multi- 


valued solution of the inviscid 
Burgers’ equation (10.2) 





Fig. 10.2. Evolution of the sol- 
ution of Burgers’ equation (10,1) 


equation). Second, it prevents multivalued solutions, as in Fig. 10.1, from de- 


veloping. | - 

The above features make Burgers’ equation a very suitable model for testing 
computational algorithms for flows where severe gradients or shocks are antici- 
pated. This role as a test-bed for computational algorithms is facilitated by the 
Cole-Hopf transformation (Cole 1951; Hopf 1950) which allows exact solutions of 
Burgers’ equation to be obtained for many combinations of initial and boundary 
conditions. Examples of this use of Burgers’ equation are provided by Fletcher 
(1983a). In Sect. 10.1.4 Burgers’ equation is used to demonstrate various compu- 
tational algorithms for convection-dominated flows. 


- There is an alternative way of handling the nonlinear convective term. This 
requires rewriting (10.1) in conservation or divergence form, 
ded ча MEE ose. 


(10.3) 





where F —0.547. The appearance of the groups of terms, like F, arises naturally in 
the derivation of the equations governing fluid flow (Sect. 11.2.1) and is often 
directly modelled in computational algorithms, particularly for compressible flow 
(Sects. 14.2 and 18.3). The same idea is used in the group finite element formulation 
(Sect. 10.3). 

The appearance of the nonlinear convective term in (10.2) also permits aliasing 
to occur. If an initial condition й(х, 0), associated with (10.2), is decomposed into its 
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Fourier components, as was done to generate (9.30), the subsequent evolution of 
the solution, which is governed by (10.2), will generate a rapid growth in the 
spectrum of wavelengths with significant amplitudes, through the product of the 
Fourier components implied by the nonlinear term 404/4х. 

When a solution is obtained on a grid of spacing 4x the smallest wavelength 
that can be resolved is 24x. The energy associated with wavelengths shorter than 
24x reappears associated with long wavelengths. This phenomenon is called 
aliasing (Hamming 1973, р. 505). Unfortunately the aliased shortwave contri- 
bution to the solution distorts the true longwave solution and may even cause 
instability (referred to as nonlinear instability) where very long time integrations 
are made, as in computational weather forecasting. 

It may be recalled (Sect. 9.2) that if any dissipation, physical or computational, 
is present it attenuates the amplitude of the short wavelengths very significantly; in 
this case the errors introduced by aliasing are minimal. This is the situation for 
most engineering flow problems, which possess physical dissipation, typically. 
However, many classes of geophysical fluid motion are characterised by the 
convection of wave-trains with little or no physical dissipation. Then aliasing, and 
nonlinear instability, can be a problem. 


10.1.2 Explicit Schemes 
An FTCS finite difference representation of (10.1) is 


dj ie i-a) viuj- 2u tuje) 


4 3 
at + 24х Ax? 


0. (10.4) 
The contribution uj to the convective term is the local solution at the node ( j, n). 
Since this is available directly, the various schemes (e.g. DuFort-Frankel, etc.), 
described for the linear transport equation (Sect.9.4), can also be applied to 
Burgers' equation. 

The concepts of truncation error and the modified equation are also relevant to 
nonlinear equations like Burgers’ equation. In applying the von Neumann stability 
analysis the contribution uj to the convective term is temporarily frozen, since the 
stability analysis is strictly only applicable to linear equations. Aithough the 
application of the von Neumann analysis to nonlinear equations cannot be 
rigorously justified it is found to be effective in practice. 

Ап obvious FTCS representation of (10.3), equivalent to (10.4), is 


wj Бу Ер о у(иу 2+1) _ 


4 2Ax Ax? 9. (10.5) 


The algorithm formed from (10.5) is implemented in program BURG (Fig. 10.4) 
when ME=1. 
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_A potentially more accurate treatment of the convective term is provided by a 
four-point 1r-point upwind discretisation. This was applied to the equivalent linear convec- 
tive term in Sects. 9.3.2 and 9.4.3. The four-point upwind discretisation of ĝF/ðx is 
introduced by replacing (F},,—Fj-,)/24x in (10.5) with ДР where, for и 
positive, 





Fj41— Fj-i 4 (5-2 3Fj- + 3F,7 Fj) 


LO F= . 
* 24x 3Ax (196) 
If u is negative (10.6) is replaced by 
Ғаі-Е;- (F;-, -—3F;+3F;,,—Fj+2) 
4) pe іні і-і 4 і-і 1 1+1 1+2 
ГЕ Mx + o (10.7) 


Equations (10.6 and 7) have truncation errors of O(Ax?) for any choice of q, 
except g=0.5 when they are of O(Ax?). For the propagating shock problem 
considered in Sect. 10.1.4, the explicit four-point upwind scheme (10.5, 6) is imple- 
mented in program BURG (Fig. 10.4) when МЕ=З. 

The equations that govern unsteady one-dimensional inviscid shock flows 
(Sects. 10.2 and 14.2) are similar i in structure to the conservation form of the inviscid 


Burgers’ equation, Le. 
дй OF 

‚. —+—=0, 10.8 
ot Ox (10.8) 


where F—0.5i?. Previously the Lax-Wendroff scheme (9.16) was applied to the 

linear counterpart, i.e. (9.2), of (10.8). Since F is nonlinear the replacement of 

Ou/Ot^ with an equivalent spatial derivative is more complicated than in 

Sect. 9.1.3. Thus from (10.8) 
д?й F д (OF 4 OF 01 _ дЕ 
0t? — xot | 2(%) Ot 


where А--ӘЕ/дй. For Burgers’ equation, А--й. Making use of the above relations 
gives 


д?й ð дЕ 


With a suitable discretised form for д(АӘЕ/дх)/дх, the Lax-Wendroff scheme, 
applied to (10.8), is 


ut ED: -052 AI -F- osos (S газа Fj) 


—A,-,(F?—F?_,)], (10.10) 
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where, for Burgers’ equation, A,,,=4;+,=0.5(u;+u,;,,), etc. Equation (10.10) has 
a truncation error of O(At?, Ax?) and is stable if |u,,, At/Ax| € 1.0. 

Because of the explicit appearance of the Jacobian A at the half steps, j —1 and 
j+4, (10.10) is computationally less economical than the equivalent algorithm 
(9.16) for the linear convection equation. A more economical evaluation is obtained 
by replacing (10.10) with an equivalent two-stage algorithm 


At 
uj. 7 0.5(0) uj 1) 0.5 (Е-Е) and (10.11) 


At 
uj ART FE . (10.12) 
Equation (10.11) generates an intermediate solution uf, ,, nominally at (n+4)4t, 
which allows F7. ,, etc., in (10.12) to be evaluated. The two-stage scheme (10.11, 12) 
is equivalent to (9.16) if F =u. For ease of coding, values of F at j +} are stored in 
F(j). This is done in program BURG (Fig. 10.4) for option ME —2. 

Historically (Richtmyer and Morton 1967) the two-stage algorithm (10.11, 12) 
has been successful for predicting inviscid compressible flow behaviour. Attempts 
to extend it to viscous flow, e.g. Thommen (1966), have been disadvantaged by the 
reduction in formal accuracy to O(At, Ах2). This gives accurate steady-state results 
but the time step is restricted by the CFL stability condition. For transient 
problems the time step must be restricted further to achieve acceptable accuracy. 
Thommen's extension of (10.11, 12) to discretise (10.3) can be written 


| At 
uf. 7055 511) 0.5 (Е-Е) 
T 0.5s[0.5(u5.. , — 265 +и?, 1) c 05(uj —2u5,, +и7+2)] › (10.13) 
4 
ини UTR CFI + 851 — 20) ui) К (10.14) 


where s=vdAt/Ax*. This scheme is implemented in program BURG (МЕ = 2). 
An examination of (10.13) indicates that four grid points are involved in evaluating 
the viscous term; this implies additional boundary conditions to evaluate u;, >. 
Equations (10.13 and 14) produce stable solutions if 


At(A*At+2v)<S Ax? . (10.15) 


Peyret and Taylor (1983, p. 56) indicate that a practical explicit criterion for a 
stable solution is 4t € Ax?/(2v -- | A| Ax). 

Equations (10.13 and 14) can be interpreted as one member of the 5% , family 
introduced by Lerat and Peyret (1975). This family of methods is discussed by 
Peyret and Taylor (1983, pp. 53-57). The related 55 family for the inviscid Burgers’ 
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equation is discussed in Sect. 14.2.2. The parameter y determines the proportional 
evaluation of the viscous term, іп (10.13) centred at grid points j and j +1. For the 
Thommen scheme, «= f — y =0.5. 

A general classification of difference schemes for the inviscid Burgers' equations 
(10.2) that includes the 5% family is provided by Yanenko et al. (1983). The 
classification also includes implicit schemes and covers the one-dimensional un- 
steady Euler equations (10.40, 41). Yanenko et al. describe and reference many 
schemes of Soviet origin that are not well-known in the Western literature. 


10.1.3 Implicit Schemes 

Applying implicit schemes to nonlinear equations such as Burgers' equation is not 
quite as straightforward as for linear е uations. Chap. 9). A Crank-Nicolson 
implicit formulation of (10.3) is 


Дит! 
At 





= — 0.5L (F! - F5*!) + 0.5 „(ии ил? 1), (10.16) 


where и" +1 = ити", Le=(— 1,0, 1)/24x and L,,—(1, —2, 1)/Ах2. То make 
use of the very efficient Thomas algorithm (Sect. 6.2.2) it is necessary to reduce 
(10.16) to a system of linear tridiagonal equations for the solution :*!. The 


appearance of the nonlinear implicit term F5*! poses a problem. 


However, this problem can be overcome in the same way that was used in 
introducing the solution correction term 477*! with the split schemes (Sect. 8.2). 
That is, a Taylor series expansion of F5*! is made about the nth time-level. Thus 


2 п 
Fitt= jal FT +0. ғара =|, +... OF 
at |; д 
Fut! = Е" АДи"*1 +0(4#?) , 


where A—[0F/0u]; =u; for (10.3). Therefore from (10.16), the following tridiag- 
onal aigorithm can be constructed: 


A п+1 
— = OSL орев") vL] ей) or (017 
uti OSAL Au ut) vLex t ри" 0 vAtL, aS. (10.18) 


In forming (10.18) it is clear that Г.Е? has cancelled out. In tridiagonal form (10.18) 
can be written as 


а р тит =, (10.19) 
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where aj = —025(At/ Ax. , — 0.55, 


bj=1+s, 
At 
cj 0.25 uj +1 7 058 , 
våt 
dj—0.5515.., +(1— su; +0.55и7., and s= . 


It is clear that the coefficients а) and ст are functions for и" and so must be 


reevaluated at every time step. However the form of (10.19) permits direct use of the 
Thomas algorithm (Sect. 6.2.2). TT 
The implicit scheme (10.18) has a truncation error of O(At?, Ax?) and is 
unconditionally stable in the von Neumann sense, The linearised Crank-Nicolson 
scheme (10.17) can be extended to include the mass-operator form and the four- 
oint upwind discretisation of the convective term (Sect. 9.4.3). Following (0.69 and 
71), a generalised Crank-Nicolson discretisation of (10.3) is written 


A ntl 
м( 1. Jom -OSLPOF да азн H) , (10.20) 











where M, = {6, 1 — 26, ô} and ДЕ is given by (10.6) for и positive. The truncation 
error for (10.20) is O(At’, Ax’). In quadridiagonal form (10.20) сап be reduced to 


енліі- қал т + bust с =, (10.21) 
where 
444, 
a6 ax?” 
At 
aj = — (0.25 +0.54) ut, +5—0.5s , 
Ах 
п At п 
Б-14054:4-20-5 > 


n q\4t , 
(025-15-05 , 


dj =(ô +0.55)ит_, +(1—26—s)uj c (6 --0.55)и , . 


Тһе quadridiagonal system of equations (10.21) сап be solved using the generalised 
Thomas algorithm, Sect. 6.2.4. In practice, the quadridiagonal system (10,21) is 
reduced to tridiagonal form (lines 134-144 in Fig. 10.4) and subsequently sub- 
routines BANFAC and BANSOL are applied in the conventional manner. The 


101 One-Dimensional Burgers’ Equation 339 


generalised Crank-Nicolson algorithm (10.21) is implemented іп program 
BURG when ME=4 or greater. 
Stable solutions of (10.21) are obtained if 6<0.25 and 420. As might be 


expected from Sect. 9.4.3, specific choices of 6 and q will reduce dispersion-related 
oscillations. This is pursued in Sect. 10.1.4. 


10.1.4 BURG: Numerical Comparison 


In this subsection the various explicit (Sect. 10.1.2) and implicit (Sect. 10.1.3) 
schemes will be applied to the propagation of a shock wave governed by the 
“viscous” Burgers’ equation (10.3). At t=0 the shock wave is located at x=0. The 
initial conditions are 


ио(х) = #(х,0)=1.0 for -х,,%5х<0, (10.22) 
ug(x) = й(х, 0)=0 or 0<х<х,,. 

The following boundary conditions are applied at x= + Xmax: 
U — Xman {) = 1.0, (ха, t)=0 . (10.23) 


For this combination of initial and boundary conditions, (10.3) has an exact 
solution given by 


i= iu [(x— суета | | f ectteegg , (10.24) 


where 


G(5 х,)= | Т 492065) 2 


and Re=! . 
y 


Equation (10.24) is plotted in Fig. 10.3 for two values of Re. In using (10.24) as the 


О 





Fig. 10.3. Exact solution 
of Burgers’ equation 





10 
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1С 
26 BURG SOLVES BURGERS EQUATION FOR A PROPAGATING 'SHDCK*' 
3c 
4 DIMENSION R(65),U(65),UD(65) ,UE(65) , A (5,65) 
5 1,EF(4),X(65) , AA(2), BB(2) ,CC(2) 
6 OPEN(1,FILEs'BURG.DAT') 
7 OPEN (6, FILE= BURG. OUT’) 
8 READ(1,1)ME, JMAX, NTIM,XMAX, AD 
9 READ(1,2)€,S,Q,EM 
10 1 FORMAT(3IS,2E10.3) 
11 2 FORMAT(FS.2,3E10. 3) 
12 С 
13 JMAF = JMAX - 2 
14 ЈМАР = МАХ - 1 
15 AMP = ОМАР 
16 DX = 2.*XMAX/AMP 
17 DT = сарх 
18 SA = AD*CeC 
19 ALPH = S*DX*DX/DT 
20 IF(ALPH .LT. 1.0E-10)ALPH = 1.0E-10 
21 ЕСЕП, = DX/ALPR 
22 QQ = Q*C/3. 
23 IF(ME 117. 3000 = 0. 
2% ий = 0 
25 IF(ABS(QQ) .GT. 0.0001)MQ = 1 
26 C 
27 АТІМ = ИТІМ 
28 ТІМ = 0, 
29 TIMAX = ОТзАТТИ 
30 С 
31 WRITE(6 ,3) 
32 3 FORMAT(' PROPAGATING SHOCK WAVE(BURGERS EQUATION) *) 
33 IF(ME .EQ. 1)WRITE(6,4)ME 
34 IF(ME .Ей. 2)WRITE(6,5)ME 
35 IF(ME .EQ. 3)WRITE(6,6)ME 
36 IF(ME .GE. 4)WRITE(6,7)ME 
37 4 FORMAT(SH ME -,12,31,18H FTCS DIFFERENCING) 
38 5 FORMAT(SH ME z,12,3X,13H LAX-WENDROFF) 
38 6 FORMAT(SH ME -,12,3X,20H EXPLICIT 4PT UPWIND) 
40 7 FORMAT(SH ME -,I2,3X,23H GENERAL CRANK-NICOLSON) 
41 WRITE(6,8) МАХ, ITIM,C,DX,DT, XMAX 
42 8 FORMAT(’? JMAX-',I3,' NTIM-',I3,' C-',F5.2, 
43 1' Dx=’,F5.3,' DT-',F5.3,! XMAX-!,F5.2) 
44 WRITE(6,9)S,ALPH,RCEL,Q,EM,SA 
45 9 FORMAT(’ Sz!',F5.2,'  ALPHz',E10.3,' RCEL=’,F6.1,’ Q=’,FS.2, 
46 1! EM=’,F6.3,’ SA-',E10.3) 
47 C 
48 IF(ME .GT. 3)GOTO 10 
49 АА(1) = 0.5%С + 3,.*QQ 
50 BB(1) =  - 3.*QQ 
51 CC(1) = -0.5*C + QQ 
52 IF(ME .NE. 2)60TO 11 
53 AMC1) = 0. 
54 BB(1) = 0.5*С 
55 СС(1) = -0.5*C 
56 АА(2) = С 
57 ВВ(2) = -Ç 
58 ce(2) = 0. Fig. 10.4. Listing of program BURG 


exact solution it is necessary to limit the time and the smallest value of Re so that 
the boundary conditions (10.23) are consistent with the exact solution. 

The various schemes described in Sects. 10.1.2 and 10.1.3 are implemented in 
program BURG (Fig. 10.4) according to the value of the parameter ME, as in 
Table 10.1. The choice 6=1/6, q=0 produces the Crank-Nicolson, group finite 
element scheme. The group finite element formulation (Sect. 10.3) is necessary to 
discretise the dependent variable F explicitly. For solutions of (10.3) with small 


с 


13 
14 


15 


16 
17 


18 


19 


20 


101 One-Dimensional Burgers’ Equation 


GOTO 11 
АА(1) = EM + O.5*S 
BB(1) = 1.0 - 2.0*EM - S 

CC(1) = EM + 0,5#5 

WVRITE(6, 12)AA,BB,CC 

FORMAT(' AA-',2F8.5,' BB-/,2F8.5,' c=’, 2F8.5,/) 


INITIALISE U AND EVALUATE UEX 
CALL EXSH(JMAX,X,U,UE,DX, XMAX. ALPB, TIMAX) 


UD(1) = U(1) 

UD(JMAX) = UCJMAX) 

DO 14 J = 1, МАХ 

DO 13 K = 1,5 

АСК, J) = 0. 

CONTINUE 

WRITE(6,15)TIM 

FORMAT(^ INITIAL SOLUTION, ТІМ =’,F5.3) 
WRITE(6,18)(X(J), J=1,IMAX) 
WRITE(6,17) (U(J), J=1, МАХ) 
FORMAT(' = X=',12F6.3) 
FORMAT(? U=? ,12F6.3) 


MARCH SOLUTION ІН TIME 


ро 27 И = 1,HTIM 
ТР(МЕ .GT. 3)GOTO 21 


EXPLICIT SCHEMES 


ІР = 1 
IF(IP .EQ. 1)EF(1) = 0.5eU(1}#U(1) 
ІР(ІР .Ей. 2)EF(1) = 0.5*UD(1)*UD(1) 
EF(2) = EF(1) 
EF (3) EF(2) 
IF(IP .EQ. 1)EF(4) 
IF(IP .EQ. 2)EF(4) 
DO 20 J = 2, МАР 
IF(ME .EQ. 3)ЕР(1) 
EF(2) = EF(3) 

= EF(4) 
IFC(IP .EQ. 1)EF(4) = 0.5*U(J*1)*U(J41) 
IF(IP .Ей. 2)EF(4) = 0,5*UD( J*1)*UD( ]41) 
DUM = S*(U(J-1) - 2.*U(J) + U(J41)) 
IF(ME .NE. 2 .OR. IP .Ей. 2)60Т0 19 
JP = J+i 
IF(J .EQ. JMAP)JP = J 
DUM = 0.5*DUM*0,5*S*(U(J)-2.*U(J41)4U(JP*1) ) 40. 5* (U( JP)-UC J)) 
UD(J) = UCJ) + AACIP)*EFC2) + BBCIP)*EF(3) + CCCIP)*EF(4) + DUM 
IF(ME .EQ. 3)UD(J) = UD(J) - QQ*EF(1) 
CONTINUE 
IF(ME .NE. 2 .OR. IP .Ей. 2)60TO 25 
ІР = IP + 1 


0.5*U(2)*U(2) 
0. 5*UD(2) *UDC2) 


EF(2) 
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бото 18 Fig. 10.4. (cont) Listing of program BURG 


values of the viscosity v it is desirable to add some artificial dissipation to the right- 
hand side of (10.16). The following expression is added: 


After linearising L,,F7*!, the modified form of (10.20) is implemented as 


Ov, ACL, (ЕҢ F5*3) . 


М,и"*! +0.54¢(LO(utut*!)—vL, us! у AL (ити !)] 


= Mut. SvALL, MD . 


(10.25) 
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115 6 TRIDIAGONAL SYSTEM FOR IMPLICIT SCHEMES 


117 21 IF(MQ .EQ. 1)DIM = U(1) 





118 ро 22 J = 2,JMAP 

119 М = 1-1 

120 IF(JM .НЕ. 1)DIM = U(JM-1) 

121 A(1,JM) = 0.5*QQ*DIM 

122 А(2,1М) = EM - 0.5%5 - (0.25*C + 1.5*QQ)*UCJM) 
123 A(3,JM) = 1.0 - 2.0*EM + S + 1.5*Q0Q*U(J) 
124 A(4,JM) = EM - 0.5%5 + (0.25*C - O,5*QQ)*UCI41) 
125 IF(ME „ИЕ. 5 )GOTO 22 

126 AC2, JM) = A(2,JM) - Q.5*SA*UCJIM) 
127 A(G,JM) = AC3,JM) + БАЯУ) 

128 A(4,JM) = A(4,JM) - O.5*SA*U(I*1) 
129 22 R(JM) = AACI)SUCIM) + BB(1)*U(J) + CC(1)5UCIS1) 
130 R(1) = R(1) ~ A(2,1)*U(1) 

131 IF(MQ .EQ. O)GOTO 24 

132 R(1) = RCL) - А(1,1)*0С1) 

133 R(2) = R(2) - A(1,2)*UC1) 

134 С 

135 С REDUCE А TO TRIDIAGONAL FORM 

136 С 

137 DO 23 JM - 3,JMAF 

138 JMK = JM - 1 

139 DUM = 4(1,JM) /A(2,JMM) 

140 A(2,JM) = A(2,JM) - A(3,JMM)*DUM 
141 AC3,JM) = A(3,JM) - АСА, IMM) *DUM 
142 A(1,JM) = 0, 

143 R(JM) = R(JM) ~ RCJMM)*DUM 

144 23 CONTINUE 

145 AC1,1) = 0. 

146 A(1,2) =0, 

147 24 А(2,1) = 0. 

148 А(4, МАР) = 9. 

149 С 

150 CALL BANFAC(A, JMAF, 1) 

151 CALL BANSOL(R,UD,A, JMAF, 1) 

152 С 

153 25 DO 26 J = 2,JMAP 

154 IF(ME .LE. 3)U(J) = UD(J) 

155 IF(ME .GT. 3)U(J) = UDCJ-1) 

156 26 CONTINUE 

157 27 CONTINUE 

158 C 

159 WRITE(6 , 28) TIMAX 

160 28 FORMAT(’ FINAL SOLUTION, TIM =’ ,FS.3) 
161 WRITE (6,16) (X(J), Jz1,JMAX) 

162 WRITE(6, 17) (UCI) ,J=1, МАХ) 

163 VRITE(6,29) (UE(J) , J=1, JMAX) 

164 29 FORMAT(' UE=',12F6.3) 

165 SUM = 0, 

166 ро 30 J = 2, МАР 

167 30 SUM = SUM + (U(J) ~ UE(J))##2 
168 RMS = SQRT(SUM/(AMP-1.)) 

169 WRITE(6,31)RMS 

170 31 FORMAT(’ RMS ERR=’ ,E12.5) 

171 STOP 

172 END 


Fig. 10.4. (cont.) Listing of program BURG 
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Table 10.1. Various schemes implemented in program BURG 
ME Description 


FTCS scheme (10.5) 

two-stage Lax-Wendroff scheme (10.13, 14) 

Explicit four-point upwind scheme (10.5, 6) 

Crank-Nicolson, fin. diff.; 6=0, а=0 in (10,20); CN-FDM 
Crank-Nicolson, fin. elem.: ó—1/6, q=0 in (10.20; CN-FEM 
Crank-Nicolson, mass oper.; $ :=0.12, 4==0 in (10.20; CN-MO 
Crank-Nicolson, 4pt. upwind; 6=0, 4:-0.5 іп (10.20; CN-4PU 
generalised Crank-Nicolson plus additional dissipation 


UA Aa & & шо го к= 
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The parameter у, is chosen empirically. Equation (10.25) is implemented when 


МЕ= 5. 


Тһе various parameters used in program BURG are described in Table 10.2. 
The exact solution (10.24) is evaluated in subroutines EXSH (Fig. 10.5) and ERFC 
(Fig. 10.6). Typical output produced by program BURG for the FTCS scheme is 


shown in Fig. 10.7. 


Table 10.2, Parameters used in program BURG 





Parameter Description 





ME ==1, FTCS scheme (10.5) 
== 2, two-stage Lax-Wendroff scheme (10.13, 14) 
== 3, explicit four-point upwind scheme (10.5, 6) 
—4, generalised Crank-Nicolson (10.20) 
= 5, generalised Crank-Nicolson plus additional dissipation 


МАХ number of points in the interval, —x,,, S X $ Xmax 
NTIM number of time steps 

TIMAX time of comparison with exact solution 
XMAX Xmax in (10.22) 

AD artificial dissipation, v, in (10.25) 

C reference Courant number, C = u(1)4t/ Ах 
ALPH viscosity, v in (10.3) 

S, ЗА з= vAt/ 4х2, s,:- v,C? (Table 10.5) 

RCEL reference cell Reynolds number, u(1)4x/v 
Q q in (10.6) 

EM $ in (10.20) 

DT, ОХ, Х 4t, 4х, x 

U dependent variable, и in (10.3) 

EF F in (10.3) 

UE exact solution, й in (10.24) 


AA, BB, CC coefficients multiplying F5. ;, etc., for explicit schemes 
coefficients multiplying и]. у, etc. for implicit schemes 


A elements in quadridiagonal matrix (10.21) 
R vector containing d7 in (10.21) 
UD intermediate explicit solution; first-stage Lax- Wendroff 


UD Гоги" *! on return from BANSOL 
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1 
2 SUBROUTINE EXSH(JMAX,X,U,UE,DX,XMAX, ALPE, TIMAX) 
3c 
4c SETS THE INITIAL U SOLUTION AND FINAL EXACT (UEX) SOLUTION 
5c 
6 DIMENSION U(65),UE(65),X(65) 
7 JMAP - JMAX - 1 
8 РІ - 3.141592654 
9 XST = -XMAX 
10 DO 1 J = 1,JMAX 
11 АЈ = 1-1 
12 XCJ) = XST + AJ*DX 
13 ШІ = 0 
14 1 UECJ) = 0, 
15 U(1) = 1.0 
16 UE(1) = 1.0 
17 С 
18 ро 2 J = 2,JMAP 
19 IF(X(J) .LT, 0.)0(Ј) = 1.0 
20 IF(ABS(X(J)) .LT. 1.0E-04)U(J) = 0.5 
21 AJ = XCD 
22 ХВ = - AJ 
23 ХА = А] - TIMAX 
24 PQ = SQRT(O.25*PI) 
25 РР = SQRT(PI*TINAX*ALPR) 
26 XB = XB*PQ/PP 
27 ХА = XA*PQ/PP 
28 С 
29 C ERFC CALCULATES THE COMPLEMENTARY ERROR FUNCTION 
30 c 
31 CALL ERFC(XA,EXA) 
32 CALL ERFC(XB,EXB) 
33 С 
34 SUMA = РРеЕХА 
35 SUNB = PP*EIB 
736 DUM = 0.5*(AJ-0. 53TIMAX) /ALPH 
37 IF(DUM .GT, 20.)DUM - 20. 
38 IF(DUM .LT.-20.)DUM --20. 
39 SUMC - EXP(DUM) 
40 UE(J) = SUMA/(SUMA + SUMC#SUMB) 
41 2 CONTINUE 
42 RETURN 
43 END Fig. 10.5. Listing of subroutine EXSH 


SUBROUTINE ERFC(X,ERC) 


1 

2 

3c 

4c COMPLEMENTARY ERROR FUNCTION 

5С 

6 В - АВ5(Х) 

7 IF(B .LT. 4)60TO0 1 

8 р = 1.0 

9 GOTO 2 

10 1 С = ЕЇР(-В*В) 
11 T = 1./(1. + 0,3275911%В) 

12 D = 0.254829592*T - 0.284496736%Т%Т + 1.421413741*TeTeT 
13 1 - 1.453152027%Т%ТжТәТ + 1.061405420eTeTeTeTeT 
14 D= 1, - Dec 
15 2 IF(X ИЛ. 0.)D = -D 
16 ЕАС = t. -D 
17 RETURN 


18 END Fig.10.6. Listing of subroutine ERFC 
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PROPAGATING SHOCK WAVE(BURGERS EQUATION) 

МЕ = 1 FTCS DIFFERENCING 

JMAX= 21 НТЇМ= 20 C= ,25 DX= .200 DT= .050 XMAX= 2.00 

S= „25 ALPHs .200E+00 RCEL= 1.0 Q= .00 EM= .000 SA= .000Е+00 
AA= .12500 .00000 ВВ» .00000 .00000 CC» -.12500 .00000 


INITIAL SOLUTION, ТІН = .000 
Х=-2.000-1.800-1.600-1.400-1.200-1.000 -.800 -.600 -.400 -.200 .000 .200 
X= .400 .500 .800 1.000 1.200 1.400 1.600 1.800 2.000 
Us 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 „500 .000 
Ux .000 .000 .000 .000 .000 .000 .000 .000 .000 
FINAL SOLUTION, ТІН з1.000 
Х=-2.000-1.800-1.600-1.400-1.200-1.000 -.800 -.600 -.400 -.200 .000 .200 
X= .400 .600 .800 1.000 1.200 1.400 1.600 1.800 2.000 
Ux 1.000 1.000 1.000 1.000 1.000 .999 .997 .991 .975 .940 .869 .751 
Ux .588 .407 .247 .132 .063 .027 .010 .003 .000 
UEs 1.000 1.000 1.000 1.000 1.000 .999 .996 .989 .973 .937 .868 .753 
ШЕя .591 .409 .247 .132 .063 .027 .011 .004 .000 
RMS ERR= .14106E-02 


Fig. 10.7. Typical output from program BURG 


For a value of the viscosity, v=0.2, a representative cell Reynolds number is 
Reen = u(1)4x/v = 1.0. This implies that the convective and dissipative terms in (10.3) 
аге of the same order. Solutions for К, = 1.0, obtained with the various explicit 
and implicit schemes provided in Sects. 10.1.2 and 10.1.3, are given in Table 10.3. 
All methods give smooth results. The FTCS scheme is the most accurate explicit 
scheme and the Crank-Nicolson four-point upwind scheme with а= 0.5 is the most 
accurate implicit scheme. 

Representative solutions for a viscosity v = 0.03 (К. = 3.33) are shown in Table 
10.4. The Crank-Nicolson finite difference scheme produces a solution that in- 
dicates dispersion-like wiggles particularly behind the shock. Both the Crank- 
Nicolson finite element scheme and the Crank-Nicolson four-point upwind scheme 
produce smooth, accurate solutions for this case. The shock is spread over about 
nine grid points so that the computational algorithms can resolve the sharp 
gradient in u without difficulty. 

If v 0.001 (К. = 100) the shock is much sharper (Table 10.5 and Fig. 10.8) 
and is likely to cause dispersion-related oscillations. 

In principle it is possible to construct low-dispersion schemes following the 
procedure in Sect. 9.4 of generating the modified equation (Sect. 9.2.2) and choosing 
q or ó in (10.20) to cancel the coefficient of the lowest-order odd derivative. The 
nonlinear nature of (10.3) causes many additional products to be generated which 
do not fit into the dispersive/dissipative pattern of Sect. 9.22. If an equivalent linear 
scheme is constructed by locally freezing the nonlinear coefficients, the following 
optimum values for ó and q are obtained: 

1 (u,4t/ dx)? 


дор, = gt 12 ^" (10.26) 


(и, At/ Ах)? 


4 (10.27) 


Фр = 0.5 + 
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Fig. 10.8. Velocity distribution at 
t= 2.00 with Reet = 100 





Clearly these values are a natural extension of the optimal values derived in 
Sect. 9.4.3 for the convecting temperature front. However, the use of (10.26) or 
(10.27) are less successful than in Sect. 9.4.3, mainly due to the additional nonlinear 
terms not accounted for. 

Consequently an alternative strategy has been adopted here. That is to choose 6 
and g empirically and to introduce sufficient artificial dissipation to eliminate most 
of the ripples without diffusing the shock front over too many grid points. The 
results shown in Table 10.5 and Fig. 10.8 indicate that this is a successful strategy 
and that even the Crank-Nicolson finite difference scheme is producing quite 
accurate solutions with judicious use of artificial dissipation. 

However, this example is well suited to the use of artificial dissipation, since the 
exact solution is constant except at the shock. Consequently a large value of v, in 
(10.25) has no influence on the solution except in the immediate vicinity of the 
shock. A better technique is to make v, a function of the solution so that it only 
assumes a significant value at the shock. This technique is discussed in Sect. 14.2.3. 

The one-dimensional Burgers’ equation with a shock-like (i.e. thin) internal 
layer can be computed with spectral methods (Sect. 5.6). A direct application of the 
Galerkin spectral method produces a solution that demonstrates global oscil- 
lations for а nine-term Legendre polynomial solution at v —0.01 (Fletcher 1984, 
pp. 194—198). Basdevant et al. (1986) obtain solutions with v —0.01/z for spectral 
tau and collocation (pseudospectral) methods (Sect. 5.6.3) based on Chebyshev 
polynomials. Without co-ordinate transformation it is found that 256 collocation 
points must be used to avoid oscillatory solutions. It is concluded that spectral 
methods, being global, are not as well suited to the calculation of thin internal 
layers as are local methods, such as finite difference and finite element methods. 


10.1.5 Nonuniform Grid 


When severe gradients occur either internally or adjacent to a boundary, more 
accurate solutions can be obtained by grid refinement. However, for a uniform grid 
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throughout the computational domain, grid refinement can be computationally 
expensive, particularly in multidimensions. 

An obvious alternative is to introduce a grid which is refined only in the parts of 
the computational domain where severe gradients are to be expected. The mech- 
anics of generating such a grid are discussed in Chap. 13. Here the use of non- 
uniform grids for one-dimensional steady problems will be examined, primarily to 
contrast the expected improvement in the solution accuracy through local grid 
refinement with the possible reduction in the order of the truncation error. 

The convection-diffusion equation (Sect. 9.3) 

dT «Т 

dx а--:-0 ; (10.28) 
with и and х constant and boundary conditions 7(0)=0, T(1)= 1, produces a severe 
boundary gradient adjacent to x —1 (Fig. 9.5) for large values of u/c. 

Local grid refinement adjacent to х-- 1 can be achieved by allowing the grid to 
grow geometrically. That is, 


x 1-Ху-”(х;- Xj- іл", Ах; А (10.29) 


ј+ 
where ғ, is the grid growth factor. For the particular example shown in Fig. 9.5 а 
value of ғ, less than one would be appropriate to place more grid points close to 
x=1.0. Typically ғ, is constant through all or a large part of the computational 
domain. 

A general three-point representation of dT/dx on a nonuniform grid that 
produces the smallest truncation error can be obtained using the technique 
described in Sect. 3.2.2. A Taylor series expansion of such a three-point discretis- 
ation about x; is 


(Tj41— Ty/r,-r4(T;— T; 1) 


(1--ғ,)4х; 

dT Ax? dT (Ах) dT 
=| — —1)— +... 10.30 
EE 6 hast (Ser NUN Dust |. ( ) 


where r, is defined by (10.29) and 4x,— x,—x,_,. This representation is of second 
order but introduces an additional third-order error that is not present for a 
uniform grid (ғ,-- 1.0). 

Application of the Galerkin finite element method to (10.28) with linear 
interpolation (Sect. 5.3.1) produces a “three-point” representation of dT/dx. А 
Taylor series expansion of the finite-element three-point representation of dT/dx is 


Т;1-Т;- [72 Ax; nit Ax? 5+1 PT | 
4 


e OLET 
(ЕТУІ) Nir (10.31) 


dx Dat г. +1 dx? 


This representation introduces a first-order dissipative term whose sign depends on 
whether the grid is geometrically growing or diminishing in the positive x direction. 
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Clearly if (r, — 1) or Ax; are not small the influence of the term for large и/х may be 
significant compared with «d?T/dx? in (10.28). 

А three-point representation of d?7/dx? іп (10.28) that minimises the truncation 
error has the following Taylor series expansion about x;: 


1,1-1; __ 2 
| ry (T; r-o (тула) 


dT Ах; aT Ax? r3+1 d*T 
-| Sa es Dicti» TI dxt | . (10.32) 
The left-hand side of (10.32) also results from a linear Galerkin finite element 
discretisation of d?T/dx?. It is apparent that the use of a nonuniform grid leads to 
the appearance of a first-order truncation error. 

Substitution of the left-hand sides of (10.30 and 32) into (10.28) produces the 
tridiagonal scheme: 





—1 1 
—(0.5К ur, 1)Т)-. 1 Еле + ( +=) hr 
0.5К 1 


The substitution of the left-hand sides of (10.31 and 32) produces the slightly 
simpler tridiagonal scheme 


1 1 
—(0.5Reey + 1)T}~1 + ( 1 «ye (оз). =0, (10.34) 


where К..ц-иАх;/о. Solutions based on (10.33 and 34) have been obtained with 
u/x = 20 and 11 points in the interval 0 € x < 1 and are presented in Tables 10.6 and 
10.7. For а uniform grid, г„= 1, the cell Reynolds number R,,, —2; this case is 
plotted in Fig. 9.5. 

For the uniform-grid case the rms error is dominated by the local errors at the 
grid points close to x — 1. Reducing r, is seen to reduce the rms error (Table 10.6) 
with solutions of (10.34) being considerably more accurate than those of (10.33). 
Since r, « 1.0 the use of (10.34) introduces positive dissipation. When using (10.33) 
the choice r, — 0.80 produces the minimum rms solution error. This corresponds to 


Table 10.6. Variation of the rms error with r, for sol- 
utions of (10.28) at u/a=20 


ғ 070 080 090 10 
rms error 


(10.33) 0.0338 00276 0.0319 0.0455 
(10.34) 0.0048 0.0080 00197 0.0455 
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Table 10.7. Solutions to (10.28) using a nonuniform grid г, —0.8, и/а = 20 


x 0153 0827 0485 0932 0970 1000 error 
T, exact 00072 00312 01010 02584 05480 1000 — 
Т, (10.33) 00006 0011 00615 02050 05013 1.0000 0.0276 
Т.(103) 00035 00229 00879 02439 05376 10000 00080 


a relatively uniform error distribution (Table 10.7). For growth factors ғ, < 0.80, 
although the local solution error is smaller close to x — 1.0, it becomes larger at 
smaller values of x. For more complicated governing equations in muitidimensions, 
grid growth factors in the range 0.8 Sr, < 1.2 are usually appropriate. 

The use of a nonuniform grid with a severe internal gradient can be illustrated 
using a modified form of Burgers' equation 

ди _ ди д?й 

a -05- "3379 . (10.35) 

Compared with (10.1) an additional term, - 0.504/дх, has been introduced. For the 
initial and boundary conditions given by (10.22 and 10.23), the inclusion of the 
additional term is equivalent to allowing the grid to migrate with the shock so that 
the shock does not move relative to the grid. Аз a result, (10.35) has a particularly 
simple steady-state solution 


à, —0. 5] 1-шш (| | (10.36) 


Decreasing у sharpens the gradient centred at х = 0. Equation (10.35) сап be written 
in conservation form as 
ді OF дш 
ôt Ox dx? 





-0, (10.37) 


where F = 0.5(i? — à). Since only steady-state solutions of (10.37) are required it is 
appropriate to introduce the fully implicit linearised two-level discretisation, 


stl 


at 





dF 
= (ut дит) Г (n x sient) (10.38) 


where L,, and L, are given by the left-hand sides of (10.32), (10.30 and 31) and 
dF/du — u —0.5. 

The application of (10.31) to дЕ/дх is an example of the group finite element 
formulation (Sect. 10.3), since F is a nonlinear group. The following tridiagonal 
system of equations for Аи"! is obtained: 


{1+ At[L(u —0.5) vLyx]} Au] = А (УЛ ГЕ"). (10.39) 
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Equation (10.39) is integrated until 4u5*! is negligibly small corresponding to a 
solution of (vL,,u;—L,F;)=0. 

For a shock centred at х=0 it is convenient to introduce a grid growing 
geometrically in both the positive and negative x directions from x=0. Solutions 
based on (10.30, 31) for v= 0.08 with 11 points in the interval, — хл, € x € хау, are 
shown in Tables 10.8 and 10.9. For r,=1.0 the solutions correspond to R,..=5 
based on (1) = 1.0. 


Table 10.8. Variation of the rms errors with r, for the 
solution of (10.39) with v = 0.08 


т, 1.00 110 1.20 1.30 
rms error 


(10.30) 0.0084 0.0093 0.0123 0.0161 
(10.31) 0.0084 0.0090 00082 0.0068 


Table 10.9. Solutions of (10.39) using a nonuniform grid, г, = 1.3, with v= 0.08 


rms 
x —1.368 —0.882 -0509 —0.221 0000 0221 0509 0.882 1.368 error 
и, exact 1.000 0.996 0,960 0.799 0.500 0201 0040 0.004 0000 — 


и, (10.30) 0.999 1.001 0.993 0.807 0500 0193 0007 -0001 0.001 0.0161 
и, (10.31) 0999 1.002 0.967 0.788 0500 0212 0.033 -0002 0.001 0.0068 


The errors shown in Table 10.8 indicate that the use of the discretisation for 
OF /dx given by the left-hand side of (10.30) contributes to a reduction in accuracy 
with increasing r,. In contrast, the use of (10.31) does cause a reduction in rms error 
for ғ, sufficiently large. However, the improvement in accuracy is not very great. 
For к„= 1.3 the two solutions are compared with the exact solution in Table 10.9. 
Both solutions are exact at x=0.0 and the major contributions to the rms error 
comes from the four points, x= +0.221, +0.509. 

The application of a nonuniform grid to “real” flow problems usually requires 
considerable numerical experimentation to arrive at the best choice for the grid 
growth factor, ғ,, and the discretisation scheme. For unsteady problems with 
locally severe gradients it may be necessary to make the grid adapt to the local 
solution. That is, the region of local refinement may change with time. Such 
adaptive grid techniques (Thompson et al. 1985; Kim and Thompson 1990) are 
beyond the scope of this book. 

Nonuniform grids may also be required where the computational boundary 
does not coincide with the local grid. Noye (1983) provides a summary of 
traditional techniques for handling this situation. A more modern approach to 
irregular boundaries is through the use of generalised curvilinear coordinates 
(Chap. 12), so that the computational boundary automatically coincides with a grid 
line. 
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10.2 Systems of Equations 


The material] presented in Sect. 10.1 and previous chapters is related to the 
occurrence of a single governing equation. However, fluid dynamic phenomena are 
described, usually, by systems of equations (Chap. 11). An exception is inviscid, 
incompressible flow which is governed by Laplace’s equation (Sect. 11.3). The 
purpose of this section is to consider a system of governing equations and to apply 
to it representative discretisations that have already been applied to a single 
governing equation. 

One-dimensional unsteady compressible inviscid flow is governed by a system 
of three equations: continuity (Sect. 11.2.1), x-momentum (Sect. 11.2.2) and energy 
(Sect. 11.2.4). After a suitable nondimensionalisation (Sect. 14.2.3) the system of 
equations can be written 





да OF 
2+5=0 , where (10.40) 
Q еч р 
245 
-| ^, and F-| “Ту | (10.41) 
„Sgu? 
5—1 eu (rosa) 


In (10.41) ọ is the density, и is the velocity, p is the pressure and y is the specific heats 
ratio. 

Although о, u and p аге » the dependent variables, discretisation is applied 
directly to g and F. This wili be illustrated here for the two-stage Lax-Wendroff 
scheme (10.11, 12). Applied to (10.40) this gives 


9)+112=0.5(9' +g) 054 LE Fr) and (10.42) 


4-4) AU j+127 Fi. 12). (10.43) 


At each stage of the solution development, ‚ и and раге evaluated from 4 (10.41) so 


that the components of F can be dete ed. 
Klopfer and McRae (1983) use the modified equation approach (Sect. 9.2.2) to 


correct the main (dispersion) error for the Lax-Wendroff scheme applied to the 
Euler equations, (10.40) modelling the one-dimensional shock tube problem 
(Problem 14.8). The resulting scheme is more accurate, less oscillatory but typically 
requires a smaller Atma: for stable solutions. 


Implicit schemes can also be constructed for systems of equations. Thus, a 
Crank-Nicolson treatment of (10.40) would be 








qj-— g- -0257- (Е, (ЕЕ! . (10.44) 
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To produce a linear system of algebraic equations for 44/7! the nonlinear terms 
F"*! are expanded as 


ғ-ғы АЛҚ... (10.45) 


where A (--дЕ/д4)15 а 3 x 3 matrix evaluated from (10.41). The specific form of A 
can be deduced from (14.99). Substituting (10.45) into (10.44) and rearranging gives 


At 
02524-14421 +144]'! 0257 4, qf] 


At . 
= — 0.5 „(Е Ё/-1) . (10.46) 


Equation (10.46) is a 3 x 3 block tridiagonal system of ‘equations for 44%” ! that can 
be solved by the block Thomas algorithm (Sect. 6. 25 5). The 3x 3 block size follows 
directly from the number of governing equations at each grid point. Having 
obtained 44%" ! from (10.46) the solution at the nth time step is obtained from 497! 
— =q; Eu Aq’ + 1 

The ste stability of the discretised scheme, e.g. (10.46), can be assessed using the von 
Neumann analysis (Sect. 4.3) as for scalar equations. Since the von Neumann 
analysis is only applicable to linear equations it is convenient to linearise (10.40) 
before discretisation. 

That is 


where 4=02Е/09 as in (10.45). The von Neumann stability analysis proceeds in the 
manner described in Sect. 4.3.4 except that an amplification matrix G is produced 
in place of the amplification factor G in (4.33). For the Crank-Nicolson scheme 
(10.46) the result is 


“н, „ү! .At ,. 
G —| 1+0.51— Asin 0 1—0.51-— 4500]. (10.47) 
Ax ~ Ax 


If А„ are the eigenvalues of G, stability requires that | Am] £ 1.0 for all т. The stability 
will depend on the local magnitude of the eigenvalues of A (14.32). Apart from the 
appearance of the Jacobian matrix A the structure of (10.47) is very similar to that 
for the Crank-Nicolson scheme applied to the linear convection equation 
(Table 9.1). 

The system of equations (10.40) is considered at greater length in Sect. 14.2.3 
(Vol. 2) where a program, SHOCK, is provided Базей on ihe two-stage Lar- 
Wendroff scheme (10.42, 43), and the MacCormack scheme (14.49, 50). A typical 


solution fora pr propagating shock wave is shown in Fig. 10.9, m 25 time steps. This 
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Fig. 10.9. Propagating shock profile, P,/P, = 1.5, у = 1.4, v=2 (14.53) 
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solution is obtained with a uniform grid 4x = 0.01 in the interval 0 € x < 1.0. At 
t = 0 the shock is located at x = 0.5; subsequently it propagates to the right with a 
nondimensional shock speed = 1.195. 

Artificial dissipation has been added to (10.40) to produce a smoother solution. 
The form of the artificial dissipative terms is indicated by (14.53). Broadly it is 
similar to the term added to the one-dimensional Burgers' equation (10.3) to give 
(10.25). However, v, the equivalent of v, in (10.25), is a function of the local solution 
so that it only has a significant value at the shock. 

Comparing the results of Fig. 10.9 with Fig. 10.8 it is clear that the computed 
shock profile for the "real" governing equation is less sharply defined than for the 
one-dimensional Burgers’ equation. This is partly due to the relative complexity of 
the equations and is partly a reflection of the relative accuracy of the Lax-Wendroff 
and Crank-Nicolson schemes. | 





10.3 Group Finite Element Method 


The finite element method has been applied to the Sturm-Liouville problem in 
Sect. 5.4, the one-dimensional diffusion equation in Sect. 5.5.1, the two dimensional 
diffusion equation in Sect. 8.3 and the two-dimensional transport equation in 
Sect. 9.5.2. In all these applications the governing equations are linear. 

However, the fluid dynamic governing equations (Chap. 11) typically contain 
nonlinear convective terms. In primitive variables the convective terms are quad- 
ratically nonlinear for incompressible flow and cubically nonlinear for com- 
pressible flow. 

The conventional finite element method introduces a separate approximate 
solution, like (5.44) for each dependent variable appearing in the governing 
equation. Application of the Galerkin method produces large numbers of products 
of nodal values of dependent variables, particularly from the nonlinear convective 
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terms. This feature contributes to an inherent lack of economy of the finite element 
method in multidimensions (Fletcher 1984, p. 243). 

However, the contribution to the lack of economy from the nonlinear terms can 
be avoided by introducing the group finite element formulation (Fletcher 1979, 
1983b). The group formulation is generally applicable to nonlinear problems but is 
particularly effective in dealing with convective nonlinearities. 

The group formulation requires two steps: 
i) the equations are cast in conservation form, for example (10.3), 
ii) a single approximate solution is introduced for the group of terms appearing in 
the differential terms, for example F in (10.3). 


10.3.1 One-Dimensional Group Formulation 


The group finite element method will be illustrated for the one-dimensional 
Burgers’ equation in conservation form (10.3). Separate approximate solutions, 
equivalent to (5.44), are introduced for й and F in (10.3) as 


и=Уфш and F-5 ФА, (10.48) 
1 1 


where F, represents F evaluated at the ИВ node. Substitution into (10.3) and 
evaluation of the weighted residual equation (5.5) based on the Galerkin weighting 
function produces a system of ordinary differential equations. On a uniform grid 
this system can be written 


d 
М, Е z] tL.F;— VL, uj-0 , (10.49) 
where M,—(1,$,1, L,— {—1, 0, 1}/24х and L,,={1, —2, 1/Ax?, for linear 
interpolating functions, ¢,, in (10.48). With Crank-Nicolson time discretisation, 
(10.49) produces an algorithm that is given by (10.20) with ó—£ and 4=0. 

If the expression L,F, is expressed in terms of и; the result can be written 


Е=0.5(иу— + иу.) t (10.50) 


If the conventional finite element method is applied to (10.1) the discretisation of 
the convective term иди/0х, on a uniform grid, is 


ди — (uj Signed iuc un 


дх 3 2Ax 





(10.51) 


The linear terms ди/ді and 224/2х2 produce the same discretisation as in (10.49). 

The discretised form generated by the conventional finite element method has 
been combined with Crank-Nicolson time differencing to produce the solution of 
the one-dimensional Burgers’ equation shown in Fig. 10.10 as CN-FEM(C). The 
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Fig. 10.10. Comparison of conventional and 
group finite element methods for the one- 
.f0009009000000000000990005 : dimensional Burgers' equation at t 2.0. 
CN-FEM(C)- conventional finite element 
formulation CN-FEM(G)- group finite ele- 
ment formulation 















+СМ-РЕМ (С) 
o CN-FEM(G) 
— EXRCT 


g.5 






8.8 


group finite element formulation is implemented by (10.20) with $ =} and q=0. In 
Fig. 10.10 it is shown as CN-FEM(G). 

The solutions shown in Fig. 10.10 are based on 41 equally-space points in the 
interval -2.0<х<2.0 and correspond to the boundary and initial conditions 
considered in Sect. 10.1.4. The cell Reynolds number, based on u(1), is Roen = 5. The 
conventional finite element solution is indicating a larger oscillation trailing the 
shock than the group finite element solution. In other parts of the computational 
domain the solutions are similar. _ 

Numerical convergence behaviour of the conventional and group finite element 
methods applied to the one-dimensional Burgers’ equation is compared by 
Fletcher (1983c). For linear, quadratic and cubic interpolation both formulations 
are of comparable accuracy. In one spatial dimension both formulations demon- 
strate similar economy. 


10.3.2 Multidimensional Group Formulation 


The group finite element formulation is demonstrated, here, on the two-dimen- 
sional Burgers’ equations (10.57 and 58). The equivalent divergence vector form of 


these equations is 
04 OF дб (84 4 д4 г)- 5—0 2 (10.52) 
4={и, v}, Е-І,ш), G={w,v7}, 


ду (9х2 "и 
5- [ter tn rt | (10.53) 












att ax’ ду 
where 
y , у 


358 10. Nonlinear Convection-Dominated Problems 


As well as the approximate solutions for u and v, additional approximate solutions 
are introduced for the groups u?, uv, v? and the components of S. A typical 
approximate solution (for rectangular elements) is 


4 
uv= У (uv) dy (é n) , (10.54) 
іші 


where Ф.(4, n} are bilinear interpolating functions (5.59). Тһе term (uv), represents 
the evaluation of uv at the Ith node. 

Application of the Galerkin finite element method, as in Sect. 5.3, produces the 
discretised equations 


——. 


p 
м.м) 9) «ЕН5, where и (10.55) 
| jk sent 
RHS= —M,@L,F-M,@L,G | 
Past 
+v(M,@L,.+M,@L,,)g+M,@M,S . 27 (1056) ^ 
ми 


The similarity in structure with (9.85) is noteworthy. This is a feature of the group 
formulation, that is, at the level at which the discretisation takes place, the 
equations are linear, although indeterminate. Substitution for the nodal groups in 
terms of the unknown nodal variables introduces the nonlinearity but also makes 
the system determinate. ~~ т 

The tensor products of the mass operator and difference operator, eg. 
М, ® L, Ё, in (10.56) indicate that the group finite element formulation is producing 
a nine-point discretisation of dF/dx. Strictly it is only a six-point discretisation 
since one coefficient in L, is zero. If the conventional finite element method, is 
applied to (10.57 and 58) the connectivity is much greater. The connectivity is 
defined here as the number of nodal groups produced by discretising the convective 
term. 

An increase in the connectivity will increase the operation count and hence the 
execution time. The connectivity grows with more dimensions, higher-order interp- 
olation or a higher-order nonlinearity in the governing equation. In order to 
estimate the likely impact on the overall execution time, an operation count 
comparison is provided in Table 10.10. 

The Burgers’ equations and compressible Navier-Stokes equations (Sect. 11.6.3) 
possess quadratically and cubically nonlinear convective terms, respectively. The 
connectivity for the conventional finite element method with linear interpolation is 
seen to rise (Table 10.10) with both increased number of dimensions and an 
increase in the order of the nonlinearity. By contrast, the connectivity of the group 
formulation is affected by the increase in the number of dimensions but not by the 
increase in the degree of nonlinearity. 

A large connectivity translates into a large operation count to evaluate the 
Steady-state residual. It is assumed that the systems of ordinary differential 
equations formed by the discretisation are marched in time using split formulations 








103 Group Finite Element Method 359 


Table 10.10. Connectivity and operation counts for the conventional and group finite element formu- 
lations with linear interpolation 


Conventional F.E.M. Group F.E.M. 


Connectivity Residual Connectivity Residual Conv. R.O.C. 
Equation Convective (convective operation (convective operation | —————— —— 


system nonlinearity nonlinearity) count nonlinearity) count Group R.O.C. 
2-D 

Burgers’ quadratic 49 828 9 206 4 

3-D 

Burgers’ quadratic 343 12603 27 1308 9 

2-D visc 

comp N-S cubic 225 6772 9 404 17 

3-D visc 

comp N-S cubic 3375 217065 27 2349 92 


like those developed in Sects. 8.2, 8.3 and 9.5. Typically at each time step the 
evaluation of the steady-state residual, RHS in (10.56), takes about 50% of the 
execution time. Consequently the residual operation count for a single time step 
correlates approximately with the total execution time. 

The results shown in Table 10.10 indicate that the group finite element 
formulation is comparatively more economical as the order of the nonlinearity or 
the number of dimensions is increased. For three dimensional viscous flow the 
group formulation will evaluate the steady state residual almost one hundred times 
more economically than the conventional finite element formulation. This cor- 
responds to approximately a factor of fifty improvement in the overall execution 
time, If higher-order interpolation than linear is used the relative economy of the 
group formulation, compared with the conventional finite element formulation, 
becomes even greater. 

The relative execution times for the conventional and group finite element 
discretisations of the two-dimensional Burgers’ equations predicted in Table 10.10 
are confirmed in practice (Fletcher 1983b). The fourfold improvement shown in 
Table 10.10 corresponds to a measured ratio of the execution times of two and 
a half. 

For the steady two-dimensional Burgers’ equation with v=0.1 the relative 
accuracy of the group and conventional finite element formulations can be assessed 
from the results shown іп Fig. 10.11. These results were obtained on a uniform grid 
with Ax - Ау. Тһе rms errors decrease with grid refinement at about the same rate, 
and for a given grid, are of about the same magnitude. Both schemes are seen to be 
more accurate than a three-point finite difference discretisation of (10.57 and 58). 
The convergence properties are based on a computed solution (Fletcher 1983b) 
which has a more moderate gradient, but predominantly in the x direction, than 
that shown in Fig, 10.12. 
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8 О = 3-FD Fig. 10.11. Spatial convergence prop- 
~ 5 erties for the discretisations of the 
LFE (С) steady two-dimensional Burgers’ 
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The exact solution is given by (10.63 and 64) with 
a, =a,=110, a,=a,=0, а= 1.0, A=5, хо = 1.0 and у-01. 


This solution is tabulated in Fig. 6.14. 

The group formulation has also been investigated by Christie et al. (1981), who 
refer to it as the "product approximation". Christie et al. have demonstrated that 
the group formulation 1$ theoretically as accurate as the conventional finite element 
method for representative model problems. 

The broad conclusion from this subsection is that a group finite element 
formulation must be used in preference to the conventional finite element formu- 
lation to achieve acceptable economy when discretising the nonlinear equations 
governing fluid dynamics (Chap. 11) in more than one dimension. However, the use 
of a group formulation is not expected to reduce the solution accuracy. 


10.4 Two-Dimensional Burgers’ Equation 


Just as the one-dimensional transport equation has a multidimensional counter- 
part (Sect. 9.5) so the one-dimensional Burgers’ equation can be extended to 
multidimensions. The two-dimensional Burgers' equations are 


ди дй дй (24 эк. 


Qt "Ox ду дк ду? (10.57) 


4 4008 0% 925 02% 
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The two-dimensional Burgers’ equations coincide with the two-dimensional 
momentum equations for incompressible laminar flow (Sect. 11.5.1) if the pressure 
terms are neglected. 


10.4.1 Exact Solution 


Like the one-dimensional Burgers’ equation exact solutions can be constructed 
(Fletcher 1983d) using the Cole-Hopf transformation. In two dimensions the 
Cole-Hopf transformation introduces a single function Ф to which й and 6 in 
(10.57, 58) are related as 


-»® -x2 
4----- ф---- . 10.5 
й $ , Uv 2 (10.59) 








As а result, (10.57 and 58) аге transformed to a single equation 


ob (02% оф 
(atta) | (10.60) 


which is the two-dimensional diffusion equation (8.1). For appropriate boundary 
and initial conditions, (10.60) has an exact solution which, through (10.59) can be 
used to provide exact solutions of (10.57, 58). 

The main interest is in the steady form of (10.57, 58). Therefore exact solutions 
of the steady part of (10.60), ie. - 

ô Uo 

— +> = 10.61 

Ax? + ду? 0 ) 
will be sought. The following exact solution of (10.61) provides considerable 
control over the corresponding velocity solutions of (10.57 and 58): 


D=a, +а,х+азу+а,ху+а, [е7 79 e 55-01 cos(Ay) , (10.62) 


where a,—as, A and x, are chosen to give appropriate features to the exact solution. 
Using (10.59) the corresponding exact solutions of (10.57 and 58) are 


—2v(a;-r a, y t Aas [е7 = —e 749779] cos(Ay)) (10.63) 


and 
u _ А(х- хо) — A(x- хо) c} 
j- 2v{a,+a,x—Aas[e te ]sin(4y)] (10.64) 


а, +а.х+азу+аху+аз [е^ 79 +е ^9 79 ]cos(Ay)] | 
2х+аз 


A typical exact solution for a steady ‘flow’ with a severe internal gradient is shown 
in Fig. 10.12. This solution is generated by the parameter choice 
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Fig. 10.12. Exact steady-state 
solution of the two-dimensional 
Burgers’ equations (10.57, 58) 


a, =a,=1.3x10'*, аз=а,=0, аұ-1.0, A=25, хо=1 and у-004. 
(10.65) 


The exact solution (10.63, 64) is particularly useful for assessing the accuracy of 
various algorithms (Fletcher 1983c), as in Fig. 10.11. 


10.4.2 Split Schemes 


The development of split schemes for multidimensional equations like the two- 
dimensional Burgers’ equations follows closely the path indicated in Sects. 8.2, 8.3 
and 9.5. The nonlinearity introduces an additional complication. How this is 
handled will be illustrated using the two-dimensional Burgers equations in conser- 
vation form (10,52). ЕЕ 

Equations (10,55 and 56), which are the discretised form of (10.52), represent 


both a group finite element and a finite difference discretisation if the mass 
operators are given the following interpretation: 


М,=(8,,1—28,,8,), M,=(6,,1-28,,4,)" , (10.66) 





as іп (9.86). The choice 6, =ô, =0 gives a conventional three-point finite difference 
discretisation; the choice ð, =ð, = gives a group Galerkin finite element formu- 
lation based on bilinear interpolation. 

It would be possible to introduce a three-level time discretisation of (10.55) as 


was done to generate (9.87). However, the Interest here I$ in using à pseudo- 


transient formulation (Sect. 6.4) to obtain the steady-state solution. Therefore the 
T ыл... iN ee a ar E a e t m Ra АЗ === 
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following two-level fully implicit (v=0, B= 1) time discretisation of (10.55) will be be 
зарание > — 


бей — 
Y 
44 ntl ` 
M,G M Е =ВН$"*! , | (10.67) 
jk 


which will lead to a split scheme equivalent to the augmented Newton's method 
(Sect. 6.4.1). To develop a linear system of equations for Ag"*' from (10,67) it is 
necessary to linearise the nonlinear terms Ё, б and 5 in RHS"*! (10.56) about the 
nth time level. This gives 


р" р" АЛД" + 2, 
G'*!-G'"- BAg"*!+..., (10.68) 
S*t1=§"4+CAg"t!+..., 


where A, B and C are evalutated from (10.53) as 


OF |2u 0 0G |v и 
=— = 10.69 
4- д4 -[ я ‚в д4 la М (10.69) 


апа 
as (3u? +02) 2uv | 
c= oq (05/0 | 2uv (и? + 352) 


Substitution into (10.67) permits the following approximate factorisation to be 
constructed: m 





ГМ, + AL, 4 — VL,, —0.5 M,C)] 4g} ,— AERHS" , (10.70) 
[M,+ A(L,B— vL, 05 M,C)] Aq?! = 44%, . (10.71) 


The appearance of the Jacobian matrices A, etc., makes (10.70 and 71) block- 


tridiagonal along grid lines in the x and y directions, 8, respectively. Since 4, étc., 


depend on the solution it is necessary to refactorise ‘the block tridiagonal system at 


ee et а e ees 


every iteration n. The block tridiagonal ‘system can be solved efficiently using the 
block Thomas algorithm (Sect. 6.2.5). 

The von Neumann stability analysis is applied to (10.70, 71) by temporarily 
freezing A, etc. The analysis indicates that the split algorithm (10.70, 71) is un- 
conditionally stable. In In practice. using 100. g too large а. time step can introduce a 
nonlinear instability. 

“Тһе split scheme has a truncation error of O(At, Ax?, Ay*). Since only the 
steady-state solution is of interest it is possible to consider modifications to the left- 
hand sides of (10.70 and 71) that permit the steady state to be reached with less 
computational effort. In the steady-state limit RHS"=0, so that modifications to 


the left-hand side have no effect on the steady-state solution, even though they may 
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influence the transient solution. The preferred strategy is to choose left-hand sides 
that can be solved more efficiently at each step of the iteration without increasing 
substantially the number of iterations required to reach the steady state. 

It was noted in Sect. 6.2.5 that m scalar tridiagonal systems can be solved more 
efficiently than one m x m block tridiagonal system of equations. Equations (10.70 
and 71) may be each treated as two sequential scalar tridiagonal systems by 
deleting off-diagonal terms in (10.69). A further economy may be achieved by 
replacing (10.69) with 


10 10 и? +02\/1 0 


This means that the same left-hand side is used for each scalar component of (10.70 
and 71). That is, the left-hand side need only be factorised (BANFAC) once for two 
solutions (BANSOL) with different right-hand sides, AtRHS{ and 4tRHS}. Equa- 
tion (10.72) with (10.70 and 71) is implemented in program TWBURG 
(Fig. 10.13). 





10.4.3 TWBURG: Numerical Solution 


The split algorithm (10.70, 71) is implemented in program TWBURG 
(Fig. 10.13) to obtain steady-state solutions of (10.52) in the computational domain 
—1xx&x1landOxysy,,, Where y,4, —z/64. Dirichlet boundary conditions are 
obtained from the exact solution (10.63, 64). The initial conditions are obtained 
from a linear interpolation of the boundary conditions in the x direction. 

The major parameters used in program TWBURG are described in 
Table 10.11. Three alternative implementations of (10.70, 71) are available. When 
ME = 1 three-point centred finite difference discretisations are introduced for L,, 
L,,, etc., and д,=0,=0. When ME = 2 the four-point upwind discretisation, used 
іп Sects. 9.3.2, 9.4.3 and 9.5.2, is used for both L, and L,. For Lf? the four points 
(j—2, k) to (1+1 К) are used; for 1,9 the four points ( j, k— 2) to (j, К+ 1) are used. 
For a typical exact solution, Fig. 10.15, и is negative adjacent to х= 1. Strictly in 
this region the four points ( j — 1, k)-(j+ 2, К) should be used to evaluate Г, This 
is not done іп TWBURG for coding simplicity. When ME=3 the mass operator 
parameters 6, and д, are chosen empirically to produce a more accurate solution. 

The right-hand side of (10.70) is evaluated in subroutine RHSBU 
(Fig. 10.14) and the exact solution (10.63, 64) is evaluated in subroutine EXBUR 
(Fig. 6.11) with the dimensions of UE, VE, etc., changed to 21 x 21. In addition the 
value of хо (10.65) is set in EXBUR (line 10). Typical output from TWBURG is 
indicated in Fig. 10.15. 

Solutions produced by the three methods corresponding to the parameter 
choice of (10.65) are shown in Tables 10.12 and 10.13 for a 6x6 grid. The 
corresponding errors in the solution as a function of x are plotted in Figs. 10.16 and 
10.17. Even on such a coarse grid all methods produce comparable accuracy with 
the largest solution errors coinciding with the location of the severe internal 
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gradient (Fig. 10.12). From the results of Sects. 9.4 and 10.1.4 a greater disparity 
between the methods might be expected if the gradient were more severe. However, 
these solutions do indicate that the splitting algorithms can be extended to 
nonlinear equations without difficulty and that potentially more accurate schemes, 
AF-4PU and AF-MO, can be implemented reasonably economically. 


1 

ac 

3c TWBURG APPLIES APPROXIMATION FACTORISATION TO SOLVE 
4c TEE STEADY 2-0 BURGERS’ EQUATIOWS FOR U(X,Y) AND V(X,Y) 
5С 

6 DIMENSION RUC21,21) ,RV(21,21) ,UC21,21) ,V( 21,21) ,UE(21,21), 
7 1VE(21,21),B(5,65) , RRU(65) , DDU(65) ,RRV(65) ,DDV(65) ,CX(3) ,CY (3), 
8 2CXQ(4) ,CYQ(4) , EMX(3) , EMY (3) ,DMX(3,3) ,DEY (3,3), A(5) 
9 COMMON DX,DY,RE,NX,NY,CX,CXQ, CY, CYQ, DX, DMY ,EMX, ЕМУ, RU, RV ,U,V 
10 QPEN(1,FILE-'TWBURG.DAT') 

11 OPEN(6,FILE-'TWBURG.QUT') 

12 READ(1,1)NX,NY,ME,ITMAX,IPR,BET,QX,QY 

13 READ(1,2)DTIM,EPS,RE,EM1,EM2 

14 READ(1,3)(A(J) ,J=1,5) AL 

15 1 FORMAT(SIS,3F5,2) 

16 2 ЕОАМАТ(5Е10.3) 

17 3 FORMAT(2E10.3,4F5.2) 

18 С \ 
19 NIP = NX - 1 
20 NIPP = NXP = 1 
21 ШҮР = NY = 1 
22 WYPP = МУР ~ 1 
23 AN = NXPP*NYPP 
24 C 
25 CALL EXBUR(UE,VE, A, AL) 
26 С 

27 CX(1) = -0,5/DX 

28 CX(2) = 0. 

29 CX(3) = 0.5/DX 

30 CY(1) = -0.5/DY 
31 CY(2) - 0. 
32 CY(3) = 0.5/DY 
33 CXQ(1) = QX/DX/3. 
34 CXQq(2) = -3.«CXQC1) 
35 CXq(3) = -сХ0(2) 
36 СХ0(4) = -схас1) 
37 CYQ(1) = QY/DY/3. 
38 CYQ(2) = -3.*CYQ(1) 
39 CYQ(3) = -CYQ(2) 
40 сүй(4) = -СҮй(1) 
41 CCX =  1./DX/DX/RE 
42 CCY = 1,/DY/DY/RE 
43 IF(ME .ИЕ. 2)QX = 0. 

44 IF(ME .ИЕ. 2)QY - 9. 
45 ЕМХ(1) = 0. 
46 ЕМҮ(1) = 0. 

47 IF(ME .Ей. 3)EMX(1) = ЕМІ 
48 IF(ME .EQ. 3) EMY(1) = EM2 | 
49 EMX(2) = 1. - 2, *EMX(1) 

50 EMX(3) = EMX(1) 

51 ЕМУ (2) = 1. = 2.*EMY(1) 

52 EMY(3) = EMY(1) 

§3 DO 4 I = 1,3 
54 DMX(I,1) = CCX*EMY(I) 
55 DMX(I,2) = -2,*DMX(I,1) 

56 DMX(I,3) = DMX(I,1) 

57 DMY(1,I) = ССҮ*ЕМХ(Т) 

58 DMY(2,I) = -2.*DMY(1,I) 


59 4 DMY(3,I) - DMY(1,I) Fig. 10.13. Listing of program TWBURG 
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60 IF(ME .EQ. 1) WRITE(6,5)WX,NY,ME, ITMAX 

61 IF(ME .EQ. 2)WRITE(6,6)NX,NY, ME, ITMAX 

62 IF(ME .EQ. 3)WRITE(6,7) WX, ИУ, МЕ, ITMAX 

63 5 FORMAT(’ 2-D BURGERS EQUATION: AF-FDM, Их,ИҮ =',213,' ME =’, 
64 112,' ITMAX-',I3) 

65 6 FORMAT(" 2-D BURGERS EQUATION: AF-4PU, ИХ,ШҮ =',213,' ME -', 
66 112,' ITMAX-',I3) 

67 7 FORMAT(^ 2-D BURGERS EQUATION: AF-MO,  NX,NY -',2I3,' ME =’, 
68 112," ITMAX-',I3) 

69 WRITE(6,8)BET, DTIM, RE, QX, СУ, EMX(1) , EMY (1) 

79 8 FORMAT(’ BETA-',F5.2,' DTIMs',F5.3,' RE-',F5.1,' QX-',F4.2 
71 1,’ QYs',F4.2,' EMX-',F5.3,? EMY-',F5.3) 

72 WRITE(6,92(4(J),J71,5) , AL, DX, DY 

73 9 FORMAT(' A= ›,2Е10.3,3Е5.2,’ AL=’,F5.2,’ DX,DY-',2F8.5,/) 
74 С 

75 С GENERATE INITIAL SOLUTION 

76 С 

77 DO 10 J = 1,NX 

78 V(1,J) = VEC1,J) 

79 VCNY,J) = VECNY,J) 

80 (1,2) = ШЕ(1,Ј) 

81 10 U(OY,J) = ЕО, 

82 DO 12 К = 2,NYP 

83 U(K,1) = UECK, 1) 

84 U(K,NX) = UECK, ЫХ) 

85 V(K,1) = VE(K,1) 

86 V(K,NX) = VECK, ИХ) 

87 DO 11 J = 2,NXP 

88 ÀJ2J-1 

89 U(K,J) = U(K,1) + 0.5*AJeDX*CU(K, WX) -UCK, 1)) 


90 11 V(K,J) = V(K,1) + 0.5*AJ«DX*CV(K,NX)2-VCK, 10) 
91 12 CONTINUE 

92 IF(IPR .LE. 1)GQTO 17 

93 DO 13 К = 1,NY 

94 13 WRITE(6,14) (UE(K, J) , J21, NX) 

95 14 FORMÁT(4E UE-,7F10.4) 

96 DO 15 К = 1,NY 

97 15 WRITE(6,18)(VE(K,J), J=1,NX) 

98 16 FORMAT(4E VE-,7F10.4) 


99 17 DO 20 J = 1,NX 
100 DO 18 К = 1,8Ү 
101 RU(K,J) = 0. 
102 18 RV(K,J) = 0. 
103 DO 19 К = 1,5 


104 19 B(K,J) = 0. 
105 20 CONTINUE 


106 ITER = 9 
107 C 

108 21 DBT = BET*DTIM 

108 СХА = 0.5*DBT/DX . 

110 СТА = 0.5*DBT/DY 

111 ССХА = DBT«CCX 

112 CCYA = DBT*CCY 

113 С 

114 CALL RHSBU(RMSU,RMSY МЕ, ОТІМ) 

115 C 

116 IF(RMSU .LT. EPS)GOTO 34 

117 IF(RMSU .GT. 1.0E+04)GOTO 34 . 
118 C 

119 C TRIDIAGONAL SYSTEMS IN THE X-DIRECTION 

120 C 
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e 


с 


22 
23 


24 


25 


26 
27 
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DO 27 К = 2,NYP 
DO 23 J = 2,XXP 
JM -J-1 
ІР-1%1 


B(2,JM) = EMX(1) - CCXÀ - CXA®U(K,JM) ~ O.5«EMX(1) *DBT*REs 
1(U(K, JM)*U(K,JM) + V(K,JM)*V(K,JM)) 

B(3,JM) = EMX(2) + 2.%ССХА - 0.5«EMX(2) *DBT*REs(U(K, J) eU(K, J) 
1 + УСК, DV, 1) 

B(4,JM) = EMX(3) - CCXA + СХА«О(К,ІР) ~ 0.5*ЕМХ (3) *DBT*RE* 
1(U(K,JP)*«U(K,JP) + V(K,JP)*V(K, JP2) 

RRUC JM) = RUCK, J) 

RRV(JM) = RV(K,J) 

IF(ME .ИЕ. 2)СОТО 23 


LST - 1 
IF(J .Ей. 2)15Т = 2 
B(1,JM) - 0. 

DO 22 L=LST,4 


tJ = J+L- 3 


B(L,JM) = B(L,JM) + CXQ(L)eU(K,LJ) «DBT 
CONTINUE 

B(2,1) = 0.. 

B(4,JM) = 0. 


ТЕ(МЕ .ШЕ. 2)GOTO 25 
REDUCE TO TRIDIAGONAL FORM 
DO 24 JM - 3,NXPP 


JMM - JM - 1 
DUM = B(1,JM)/B(2,JNM) 


B(2,JM) = B(2,JM) - B(3,JMM)*DUM 
B(3,JM) = B(3,JM) - B(4,JMM) «DUM 
B(1,JM) = 9. 

RRUCJM) = RRU(JM) ~ RRUC IMM) «DUM 
RRV(JM) = RRV(JM) - RRV(JMM)*DUM 
B(1,2) < 9. 

CALL ВАИРАС (В, ИХРР, 1) 


CALL BANSOL(RRU,DDU,B, ИХРР, 1) 
CALL BANSOL(RRY ,DDV, B, IXPP , 1) 


ро 26 J = 2,NXP 
ІМ-1-і 
RU(K,J) = DDU(JM) 
RV(K,J) = DDV(JM) 
CONTINUE 


TRIDIAGONAL SYSTEMS IN THE Y-DIRECTION 


DO 33 J = 2,NXP 
DO 29 К = 2,ШҮР 
KM = K ~ 1 
KP = K +1 


B(2,KM) = EMY(1) - CCYA - CYA*V(KM,J) - O.5*EMYC1)*DBT*RE* 
1(U(KM,J)*U(KM, J) + У(КМ, 7) *VCKM, J)) 

B(3,KM) = EMY(2) + 2.*CCYA ~ O.5«EMY(2) «DBTe*REe(U(K, J)sU(K, J) 
1 + V(K,J)*¥(K,J)) 

B(4,KM) = EMY(3) - ССҮА + CYA*V(KP,J) - O.S5*EMY(3)*DBT*RE* 
1(U(KP,J)*UCKP,J) + V(KP,J) *V(KP, J)) 

RRUCKM) = RU(K,J) 

RRV(KM) - RV(K,J) 

IF(ME „ШЕ. 2)GOTO 29 


Fig. 10.13. (cont) Listing of program TWBURG 
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182 LST = 1 

183 IF(K .EQ. 2)15Т = 2 

184 B(1,KM) = 0. 

185 DO 28 L=LST,4 

186 LK = K+L- 3 

187 28 BCL, KM) = B(L,KM) + CYQ(L)*V(LK,J)*DBT 
188 29 CONTINUE 

189 B(2,1) = 0, 

190 B(4,KM) = 0. 

191 IF(ME „ШЕ. 2)GOTO 31 

192 С 

193 С REDUCE TO TRIDIAGONAL FORM 

194 C 

195 DO 30 KM - 3,NYPP 

196 КИМ = KM - 1 

197 DUM - B(1,KM)/B(2,KMM) 

198 B(2,KM) - B(2,KM) - B(3,KMM)«DUM 
199 B(3,KM) - B(3,KM) - B(4,KMM)«DUM 
200 B(1,KM) = 9. 

201 RRUCKM) = RRUCKM) - RRU(KMM)*DUM 
202 30 RRV(KM) = RRV(KM) - RRV(KMM)&DUM 
203 B(1,2) = 9. 

204 С 

208 31 CALL BANFAC(B,NYPP,1) 

206 CALL BANSOL(RRU ,DDU,B,NYPP,1) 
207 CALL BANSOL(RRV, DDV,B,NYPP,1) 
208 C 

209 C INCREMENT U AND V 

210 C 

211 DO 32 K - 2,NYP 

212 KM = K- 1 

213 UCK,J) = U(K,J) + DDUCKM) 

214 32 V(K,J) = V(K,J) + DDV(KK) 

215 33 CONTINUE 

216 ITER = ITER + 1 

217 IF(ITER .LE. ITMAX)GOTO 21 

218 С 

219 С COMPARE WITH EXACT SOLUTION 

220 С 

221 34 SUMU = 9. 

222 SUMV = 0. 

223 DO 36 J = 2,NXP 

224 DO 35 K = 2,NYP 

225 ру = U(K,J) - МЕСК,1) 

226 DV = V(K,J) - VE(K, 2) 

227 SUMU = SUMU + DU*DU 

228 35 SUMV = SUMV + Буеру 

229 36 CONTINUE 

230 RMSUE - SQRT(SUMU/AN) 

231 RMSVE = SQRT(SUMV/AN) . 
232 WRITE(6,37)ITER,RMSU,RMSV,RMSUE, RMSVE 
233 37 FORMAT(’ AFTER',I3,' ITERATIONS, RMS-RESz',2E10.3, 
234 1% RHS-ERR-',2E10.3) 

235 DO 38 K - 1,NY 

236 38 WRITE(6,39) (U(K,J) ,J21, NX) 

237 39 FORMAT(’ U=',7F10.4) 

238 DO 40 К = 1,NY 

239 40 WRITE(6.41)(V(K,J),Jz1.NX) 

240 41 FORMAT(’? У-?,7Ғ10.4) 

241 STOP 

242 END 


Fig. 10,13, (cont.) Listing of program TWBURG 
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Table 10.11. Parameters used in program TWBURG 
i до 





Рагатаег Description 
ME =1, approx. fact., three-point f.d. method, AF-FDM 
=2, approx. fact., four-point upwind scheme, AF-4PU 
=3, approx. fact., mass operator scheme, AF-MO 
NX, NY number of points in the x and y directions 
ITMAX maximum number of iterations 
IPR > 1, print the exact solution и and v. 
BET weights the nth and (n+ 1)-th time levels, = 1 in Tables 10.12 and 10.13 
ОХ, QY 4x, 4, in the four-point upwind discretisation 1/4, 149 
DTIM, DX, DY At, Ах, Ау 
EPS tolerance on | RHS' ||, п. (10.70) for steady state convergence 
RE Reynolds number, Ке = 1/у 
ЕМІ, EM2 9,, д, in (10.66) 
А а, to ag іп (10.65) 
СХ, СҮ L,, L, in (10.70, 71) 
СХО, CYQ increment to L,, L, to produce LẸ’, 1/9 
EMX, EMY M,, M, in (10.70, 71) 
DMX, DMY М, ® Lyx» M, & L, in (10.56) 
U, V dependent variables 
UE, VE exact solution, (й, 5), given by (10.63 and 64) 
DDU, DDV components of 44%, and 4471! on return from BANSOL 
RU, RV components of RHS" іп (10.70); temporary storage for 441, 
RRU, RRV one-dimensional components of RU, RV required Бу BANSOL 
B quadridiagonal and tridiagonal matrix; left-hand side of (10.70, 71) 
RMSU, RMSV components of ||RHS"|| ns 


RMSUE, RMSVE 


Table 10.12. Variation of u with x for y/y,,,7 0.4, NX 26, NY 26 


x —1,00 


Exact 
AF-FDM 


AF-4PU, 
а= 1.0, q,=0.0 


AF-MO 
5, =5, = 0.26 


rms 
—060 -020 020 0.60 1,00 error 
1.0000 | 10000 0.5375 —0.0333 — 0.0250 — 0.0200 — 
10000 1.0022 0.5221 -00322 --0.0250 - 0.0200 0,0066 
1.0000 1,0003 0.5247 —0,0326 — 0.0256 — 0.0200 0,0054 
1.0000 1.0004 0.5257 —0.0365 — 0.0233 - 0.0200 0,0045 





60 
61 
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SUBROUTINE RHSBU(RMSU,RMSV,ME,DTIM) 
EVALUATES RIGHT-HAND SIDE OF TEE 2-0 BURGERS’ EQUATION 


DOUBLE PRECISION RUD,RVD,SUMU,SUNV,AN,DSQRT 
DIMENSION RU(21,21),RV(21,21),U(21,21) ,V(21,21), 
1DMX(3,3),DMY (3,3) ,EMX (3) ,ENY(3) , CX (3) , CY(3) ,CXQ(4) ,СҮ0(4) 
2,ғ(2),6(2) ,ES(2) 

COMMON DX,DY,XE,NX, NY, CX, CXQ, CY , CYQ, DMX ‚ОМУ , EMX ‚ЕМУ , RU, RY ,U, V 
WIP = MX - 1 


WYP = WY - 1 

SUMY = 0. 

SUMV = 0. 

DO 7 J =2,8ХР 

Do 6 К = 2, МР 

RUD = 0. 

RVD = 0. 

ро 2 И = 1,3 

МК =К-2+и 

DO 1 M - 1,3 
MJ2J-2-*4M 

F(1) = U(WK,MJ)*UCNK,MJ) 
F(2) = ЦСК, МЈ) УСИК, ИЈ) 
G(2) = VCNK,MJ)*VCNK HJ) 
6(1) = F(2) 


ЕЗ (1) = 0.S*RE*UCNK,MJ)*(F(1) + 6(2)) 
Е5(2) = 0.S*RE*V(NK ,MJ)*(F(1) + 6(2)) 


DUM = СХ(М)»ЕМҮ(И)*Е(1) + СҮ(И)*ЕМХ(М)*6С(1) 

RUD = RUD*(DMXCN,M)*DMY(WN,M))*UCNK, MJ) +ЕМХ (М )*ЕМУ(И)*+ЕЗ (1) - DUM 
DUM = CX(M)*EMY(N)*F(2) + CYODSEMX(D *6(2) 

RVD = RVD+(DMX(N,M)+DMY (NM) )*У (ИК , MJ) +ЕМХ (M) sEMY(N) *ES(2) - DUM 
CONTINUE 

IF(ME „МЕ, 2)60ТО 5 

MST = 1 


IF(J .EQ. 2)MST = 2 

ро 3 M = MST,4 

MJ =J-3 +48 

RUD = RUD - CXQ(M)«UCK MJ) *U(K,MJ) 

RVD = RVD - CXQ(M)*U(K, MJ)*V(K,MJ) 

IF(J .Eg. 2)RUD = RUD - CXQ(1)9U(K, 1) *U(K, 1) 
IF(J .EQ. 2)RVD = RVD - CXQ(1)*U(K, 1) *V(K,1) 
ИЗТ = 1 

IF(K .EQ. 2)XST 
ро 4 И = NST,4 
BK=K-3+5 
RUD = RUD ~ CYQ(NRD*U IK, J2*VCIK, 7) 

RVD = RVD ~ CYQC(ID*SVOIK, J) *V(IK , J) 

IF(K .EQ. 2)RUD = RUD - CYQ(1)*U(1,J)*V(1,J) 
IF(K .EQ. 2)RVD = RVD - CYQ(1)*V(1,2)*V(1,2) 
SUMU - SUMU * RUD*RUD 

SUMV = SUMV +  RVD*«RVD 


"ow 


2 


RU(K,J) = RUDeDTIM 
RV(K,J) = RVDeDTIM 
CONTINUE 


AN = (ИХР-1)«(ҰҮР-1) 
RMSU = DSQRT(SUMU/AN) 
ВИЗУ = DSQRT(SUMV/AB) 
RETURN 

END 


Fig. 10.14. Listing of subroutine RHSBU 
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2-D BURGERS EQUATION: АР-МО, NX,NY = 6 6 HE = 3 ITMAXs 20 
ВЕТАз 1.00 DTIMN= .010 RE» 50.0 QX« .00 QY- .00 EMX= .260 EMY= ‚260 


.1008%14 .100Е+14 .00 .00 1.00 AL=25.00 DX,DY= .40000 .00419 


-.0250 -.0200 
-.0250 -.0200 
-.0250 -.0200 
-.0250 -.0200 
-.0250 -.0200 
-.0250 -.0200 
.0000 .0000 
.0000 ,0000 
+0000 .0000 
.0000 .0000 
.0000 .0000 
.0000 .0000 


AFTER 21 ITERATIONS, RMS-RHS= .624E-04 .801E-05 RMS-ERR= .449E-02 .120E-02 


Ат 
UE= 1.0000 1.0000 „5505 -.0333 
Uf» 1.0000 1.0000 .5491 -,0333 | 
UE= 1.0000 1.0000 .5448 =,0333 
UEa 1.0000 1.0000 .5375 -.0333 
UE= 1.0000 1.0000 .5271 -.0333 
UE= 1.0000 1.0000 .5132 2,0333 
VE= .0000 „0000 ‚0000 .0000 
ҮБл .1051 „1051 ‚0600 .0000 
VES .2126 „2126 +1204 .0000 
VE= «3249 3249 «1818 .0000 
YÉ2 4452 .4452 .2447 .0000 
Vis 5774 „5773 „3097 .0000 
Ua 1.0000 1.0000 ‚5505 -.0333 
Us 1.0000 1.0002 .5426 -.0354 
Us 1.0000 1.0003 .5349 -.0364 
Ua 1.0000 1.0004 .5275 -.0365 
Ул 1.0000 1.0003 .5203 -.0354 
Ua 1.0000 1.0000 .5132 -.0333 
Үл .0000 .0000 .0000 .0000 
Үл „1051 „1051 „0586 -.0004 
Ys 2126 „2126 1180 -,0008 
үл .3249 ‚3250 .1790 -,0009 
үл .4452 .4453 .2425 -.0007 
у= .5774 .5773 „3097 ‚0000 


Fig. 10.15. Typical output from program TWBURG 
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Fig. 10.16. Error distribution for the u solution 
of the two-dimensional Burgers’ equation at 


У/Ушах = 0.4 
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Fig. 10.17. Error distribution for the v solution 
of the two dimensional Burgers' equation at 


У/Утах = 0.4 
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Table 10.13. Variation of v with x for y/y,,,.,=0.4, МХ--6, NY=6 


x -100 4-060 -020 0.20 0.60 1.00 Error 
Exact 0.3249 0.3249 01818 900000 0.0000 0.0000 — 
AF-FDM 0.3249 03251 01774 900003 0.0000 0.0000 0.0018 
AF-4PU 0.3249 03246 0.1781 0.0003 — 0.0002 0.0000 0.0015 
q. = 1.0, 4, =0.0 
AF MO 0.3249 0.3250 0.1790 —0.0009 0.0005 0.0000 0.0012 
х= 5, = 0.26 


10.5 Closure 


Computational techniques, introduced in conjunction with the diffusion equation 
in Chaps. 7 and 8, and the linear convection and transport equations in Chap. 9 
have been extended to a family of nonlinear equations, Burgers’ equations, in one 
and two dimensions. Burgers’ equations contain the same form of convective 
nonlinearity as in many of the fluid dynamic governing equations. 

Burgers’ equations possess readily computable exact solutions for many com- 
binations of initial and boundary conditions, For this reason they are appropriate 
model equations on which to test various computational techniques. This feature 
has been exploited, for the one-dimensional Burgers' equation, in Sects. 10.1.4 and 
10.3.1 and, for the two-dimensional Burgers’ equations, in Sects. 10.3.2 and 10.4.3. 

To take advantage of the Thomas algorithm it is necessary to linearise the 
convective nonlinearity. This is required for both implicit schemes in one dimen- 
sion (Sect. 10.1.3) and for split schemes in multidimensions (Sect. 10.4.2). This 
linearisation extends to systems of equations (Sect. 10.2) without difficulty. V 

The Fourier analysis which was directly applicable to linear equations 
(Sects, 9.2.1 and 9.4.3) is still made use of in the von Neumann stability analysis 
after freezing the nonlinearity. The modified equation approach (Sect. 9.2.2) is 
applicable to nonlinear equations. However, products of higher-order derivatives 
appear in sufficient number and magnitude as to make the identification of 
dissipative and dispersive properties less precise. It is usually helpful to analyse the 
equivalent linear equation as an intermediate step, since this equation often 
demonstrates qualitatively equivalent dissipative and dispersive behaviour. The 
construction of more accurate schemes often requires a certain amount of empiri- 
cism imposed on the guidelines provided by the corresponding linear equations. 

Most of the computational techniques developed for linear equations extend to 
nonlinear equations. However convective nonlinearities are often handled more 
effectively in conserved variables, e.g. F in (10.3). Conserved variables will be used, 
where appropriate, in Chaps. 14-18. — о ооо оо е —— 
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The nonlinear nature of the governing equation makes the application of the 
conventional Galerkin finite element method uneconomical, particularly in multi- 
dimensions and if higher-order interpolation is used. This problem is overcome by 
casting the equations in conservation form and in introducing approximate sol- 
utions for the conserved groups. The group formulation is often more accurate 
particularly where the conservation form of the governing equations is preferred on 
physical grounds. 


10.6 Problems 


1-D Burgers’ Equation (Sect. 10.1) 


10.1 


10.2 


10.3 


10.4 


10.5 


10.6 


Determine the truncation error of the scheme, (10.20), with the nonlinear 
coefficients frozen, for the two cases: (i) а=0, (ii) 6=0. Show that the 
optimum values of 6 and q are given by (10.26) and (10.27), respectively. 
Modify program BURG to implement a Crank-Nicolson finite difference 
discretisation of (10.1) and compare the solution with that of the Crank- 
Nicolson finite difference discretisation of (10.3) for the conditions associated 
with Table 10.5. How does the comparison change when the integration is 
made to a larger time? 

Modify program BURG to impiement (10.26 and 27) and compare 
solutions for the intermediate cell Reynolds number conditions appropriate 
to Table 10.4. 

Program BURG can be used to obtain steady-state solutions of (10.37) 
with F-0.5(z7—i) Implement these changes for the general three-level 
scheme, equivalent to (8.26), and obtain solutions with i) y 20, f — 1, ii) у= 0, 
В=0.5, iii) y 20.5, В=1.0, Compare the number of iterations to reach the 
steady state, for the conditions corresponding to Table 109, but with a 
uniform grid. 

For the convection diffusion equation (10.28) code the nonuniform discret- 
isation (10.30-32) and confirm the results shown in Tables 10.6 and 10.7. 
Develop a four-point upwind discretisation on a nonuniform grid with one 
free parameter (equivalent to q). Assume that r, is constant but not equal to 
unity. Apply this discretisation to the modified Burgers' equation (10.37) 
with the two-level fully implicit marching scheme (10.38). Compare the 
accuracy of the solutions with those shown in Tables 10.8 and 10.9 for 
various values of the free parameter. 


Systems of Equations (Sect. 10.2) 
10.7 Modify program SHOCK (Vol. 2, Fig. 14.17) to implement the explicit four- 


point upwind scheme introduced in Sect. 9.4.3. Obtain solutions corre- 
sponding to those shown in Fig. 10.9 with sufficient artificial dissipation to 
suppress excessive oscillations. It may be necessary to use a small value of At 
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(small Courant number, C) to obtain stable solutions Compare the sharp- 
ness of the shock profile with that shown in Fig. 10.9. 

10.8 Extend the Crank-Nicolson finite difference scheme (10.44) to generate a 
Crank-Nicolson mass operator scheme with variable 6 for the system of 
equations (10.40). Modify SHOCK (Fig. 14.17) to execute this scheme and 
compare the quality of the solution with that shown in Fig. 10.9, for 6 
chosen empirically. 


Group Finite Element Method (Sect. 10.3) 


10.9 Implement the conventional finite element method with Crank-Nicolson 
time differencing in program BURG, for the one-dimensional Burgers’ 
equation, and compare the solution accuracy with that produced by the 
group finite element method and the Crank-Nicolson mass operator scheme 
for the conditions corresponding to Table 10.5. 

10.10 Estimate the connectivity and residual operation count for the conventional 
and group finite element formulations applied to the two-dimensional 
Burgers’ equations for quadratic interpolation. 

10.11 Modify program TWBURG to implement the conventional finite el- 
ement method applied to (10.57, 58) and compare the accuracy and economy 
with that of the group finite element method applied to (10.52) for the 
conditions corresponding to Tables 10.12 and 10.13. How does the compari- 
son change on a finer grid? 


2-D Burgers’ Equation (Sect. 10.4) 


10.12 Write a program, based on subroutine EXBUR, to generate exact 
solutions of the two-dimensional Burgers’ equations that produce more 
severe internal gradients. 

10.13 Obtain solutions using program TWBURG оп 6x6, 11 x 11 and 21 x 21 
grids for the conditions corresponding to Tables 10.12 and 10.13, with 
0<2,, 0,<0.30, 0%4,, а,= 1.0. Decide on optimal choices for 6,, ô, and 
qx, 4, for each grid. 

10.14 Execute program TWBURG with the following choice of parameters: 
а, =a,=0.01101, a, =а, —0, а;--1.0, 4—5, хо = 1.0 and v 20.1. This choice 
produces a moderate boundary gradient adjacent to х=1.0. What are 
optimal choices for 6,, д, and qx, д, on an 11 x 11 grid for this case? 


Appendix 


A.1 Empirical Determination of the Execution Time 
of Basic Operations 


А computer program, COUNT, is provided in Fig, A.1 which allows the execution 
time of various types of operation to be determined approximately. The type of 
operation corresponding to different values of ICT are given in Table A.1. 

For each operation a nested pair of DO loops is executed. The inner loop 
repeats the given operation 10000 times. The outer loop repeats the inner loop N 
times. For N sufficiently large the time from the input of ICT and N (line 15) to the 
output of ICT and N (line 48) can be determined manually. Alternatively a timing 
subroutine, if available, can be used to measure the CPU time between lines 15 
and 48. 


Table А.1. Operations considered in Program COUNT 


Execution time 


ICT Operation (secs) for N = 1000 ` 
1 Empty DO loops 12.3 
2 Replacement 15.8 
3 Addition and replacement 19.1 
4 Subtraction and replacement 19.1 
5 Multiplication and replacement 19.0 
6 Divisien and replacement 34.8 
1 Integer IF statement 15.3 
8 Power and replacement 81.9 
9 Square root and replacement 111 

10 Sin and replacement 112 

11 Exponential and replacement 123 

12 Array addition and replacement 25.1 





The execution times shown in Table A.1 are appropriate to the Supermicro- 
computer (SUN Sparc 811) in Table 4.4. The execution time for ICT =1 is 
subtracted from cases ICT = 2 and 7. For all other cases the execution time for 
ICT = 2 is subtracted. For the fixed point operations (FX) shown in Table 4.4 the 
integer statement (line 6) is activated and the library function evaluations (lines 38, 
40 and 42) are replaced with labelled CONTINUE statements to satisfy the GOTO 
statement (line 21). It is recommended that program COUNT be compiled in an 
unoptimised mode to obtain consistent results. 
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1 б COUNT FACILITATES THE DETERMINATION OF THE EXECUTION 
2c TIME FOR SPECIFIC OPERATIONS, DEPENDING ОШ THE VALUE OF ICT. 
з c N IS THE NUMBER OF OUTER LOOPS. 

4 С 

5 DIMENSION Е(10000),Р(10000),6(10000} 
вс INTEGER A,B,C,E,F,G 

7 В = 3.0 

8 С = 2.0 

9 ІА = 8 

10 ро 21 Ј = 1,10000 

11 F(J) = 2.0 

12 21 6(1) = 1.0 

13 WRITE(*,22) 

14 22 FORMAT(5X,'VALUE FOR ICT AND И›,/) 
15 READ(*,*)ICT,N 

16 C 

17 DO 14 I = 1,N 

18 DO 13 J = 1,10000 

i9 C 
20 GOTO(1,2,3,4,5,6,7,8,9,10, 11,12), ICT 
21 1 CONTINUE 

22 GOTO 13 
23 2А=С 
24 GOTO 13 
25 ЗА=В+С 
26 GOTO 13 
27 4А-В-С 
28 GOTO 13 
29 5 A = Bec 

30 GOTO 13 
31 6 A= B/C 

32 6070 13 

33 7 IF(I .GT. O)GOTO 13 

34 GOTO 13 

35 8 А = Beeya 

36 GOTO 13 

37 9 А = SQRT(B) 

38 GOTO 13 

39 10 A = SIN(B) 
40 GOTO 13 
41 11 А = EXP(B) 
42 GOTO 13 
43 12 E(J) = F(J) + 6(1) 
44 13 CONTINUE 
45 14 CONTINUE 

46 C 

47 VRITE(*,15)ICT,X 
48 15 ҒОВМАТ(55," ІСТ=',17,5Х,' N=',I7) 
49 STOP 

50 END Fig. А.1. Listing of Program COUNT 


А.2 Mass and Difference Operators 


It is indicated іп Sects. 5.5.1 and 5.5.2 that the finite element method can be 
interpreted as providing a term-by-term discretisation if directional mass and 
difference operators are identified explicitly. The origin of these operators within 
the Galerkin finite element formulation, Sects. 5.1 and 5.3, will be indicated here in 
relation to the two-dimensional transport equation (9.81) 


т oT OT гт ёгт 


at a 3x) 3:79 . (А.1) 
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An approximate solution is introduced for T as 
1 
Т-7/Т,.%(ху), (А.2) 
ізі 


where ф.(х, y) are two-dimensional bilinear Lagrange interpolating functions, 
equivalent to (5.60). Equation (A.2) is substituted into (A.1), the Galerkin weighted 
residual integral (5.5 and 10) is evaluated and the result can be written 


M,@M,T+uM, @L,T+0M, 9 І,Т-о,М,ФІ,,Т-а,М,ФІ,Т-0, 
(А.3) 


where ® is the tensor product and T=dT/dt. The terms М, and M, are 
directional mass operators and are defined by 


1i+ror r, ltr, 1]7 > 
= 44- хх = - y — . 
M, | | , M, в}, 3 Я (А.4) 








A=} +r), В-2/(1-ғ,). 


The directional difference operators are given by 


0,054 , виа] 1 (Lr) І | 








Ах” “Ах Ax?' r,Ax?'rAx? 
0.5 0.5)7 1 (itr) 17 (A3) 
br apto] Pn sh eara] 
where r, and r, are grid growth ratios 
ain и y, Жат (A.6) 


x y , 


Ук 7 Ук-1 
so that on a uniform grid r, — r, — 1. 

It can be seen that there is a term-by-term correspondence between the original 
equation (A.1) and the discretised equation (А.З). The source of this correspondence 
сап be found by considering a single term 2Т/дх in (A.1). Application of the finite 
element method to (A.1) includes the following contribution from д Т/дх: 


1- X( Xo tasa) то. (A.7) 


where У denotes the contributions from the elements adjacent to node т. The 


- , 
Xj — Xi-1 


e 
Lagrange interpolating functions Фф, can be written as the product of one- 
dimensional interpolating functions 


фи = фф, (A.8) 
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where ф®, ф0? are given by (5.45 and 46). Consequently the contributions to the 
integral in (A.7) can be split into directional components 


I -X(X Jom eP dyf pp HE ix) T, . (А.9) 


As а result it is convenient to introduce the operators 








М, = B Y [o2 of? ay (A.10) 
Ду, 112 е 
апа 
А dg” 
Г. = (x) Tt , ДІ 


where it is assumed that the mth Galerkin node coincides with the global grid point 
(j, К), so that 


AXj-4j2=Xj—Xj-, and 4у 1)2=Ve-Ve-1 - (A.12) 


Thus the term ОТ/дх in (A.1) is discretised as М, © L, T. On the global grid, this 
has the form 


u T, Ту+а,к+1 7 1;-1,өе1 1 tr, Туу Tj ik 
M, @L,T= 4B| $d 24х |+ 3 24х 


1 11-17 1,6-1 
~~ fe ВВ . 1 
| 24х | (А13) 


Where second derivatives occur in the governing equation (А.1), application of 
integration by parts leads to the following definition of the operator L,,, at an 
interior point: 


Lys, = (SE) (SE as (A.14) 


Ах;-1/2% дх 4х. 








Comparable definitions to those given іп (А.10, 11 and 14) are available for M,, L, 
and L,,, respectively. In practice the integrals in (A.10), etc., are evaluated by 
introducing element-based coordinates (č, у) as in (5.58-60). For linear elements, 


фе -0.5(1--54,) with €,=+1 and —1<ё<1. (A.15) 


Equations (A.10, 11, 14) are applicabie for any order of Lagrange interpolation. For 
quadratic interpolation the operators іп (А.4, 5) would have five components 
associated with corner nodes and three components associated with mid-side 
nodes. 

It is apparent from (A.5) that the form of the L, and Г... operators is similar to 
finite difference discretisation. In contrast, the mass operators (A.4) arise from the 
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integral nature of the Galerkin method. Therefore, in a three-dimensional problem 
дТ/дх would be discretised as 


2Т/дх-«М,9М,ФІ,Т. (А.16) 


For brick elements with trilinear interpolation this implies that M, & М, © L; 
would be at most a 27-point operator. 

On a uniform grid there is a connection between the role of the mass operators 
and Padé differencing. A fourth-order evaluation of дТ/дх using Padé differencing 
is obtained by solving the tridiagonal system 


1 oT 2 oT 1 OT Tj+1—-Tj-) 
-( (Z) E) ашыта 1 
ax). 8c) 8 s ). 24x ' (^ 
ог 
M LLT.. (А.18) 
дх 


It тау be noted that here the mass operator is applied in the same direction as the 
derivative is taken. In contrast in the Galerkin finite element method, 


8T/8x-+M, @L,T , (A.19) 


and an explicit formula involving the transverse direction appears. However, on a 
uniform grid (A.19) is also a fourth-order accurate discretisation. 

To obtain fourth-order accuracy in discretising 02 Т/дх? using the equivalent 
of (A.18) or (A.19), it is necessary to generalise the form of the mass operator, 
as in (8.44) to M, = МІ = {5,1 — 20, ô}. The choice б = 1/12 gives fourth-order 
accuracy, Table 8.2. 
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